How To Synchronize FPGAs

Question

Hello newsreaders,For a while I have been confronted with the following task which I findquite challenging but unfortuantely didn't manage to solve it, yet.What I want to do is to use 2-4 FPGAs (Xilinx Virtex 2 Pro) together on oneprinted circuit board (PCB). They are used to process a large amount ofincoming serial data (data rates of several GHz's). My idea is to handlethat data parallel by the 2-4 FPGAs. But now there arises the problem how toadequately split the data and how to synchronize the FPGAs among oneanother, in particular?Is it possible or first of all a realistic idea to synchronize multipleFPGAs in the GHz range? How can this be done without much protocolloverhead? I would like to do it without applying an extra transfer protocollamong the FPGAs just for that purpose! Up to this date I didn't find aproper solution, yet.Maybe someone can give me a hint? Any ideas how to solve that problem?Regards,    Leroy Tanner

Don Golding · Accepted Answer

Maybe I am missing something, but wouldn't you just drive all the chips with one onboard clock then in your code trigger the processes on the rising edge?Don

Josh Model · Answer

Post Below...withhowStart Post....It gets tricky when you have multiple FPGAs clocked at hundred(s) of MHz.  Idon't have any direct expeience there, but I think looking for appnotes onvendor sites that address "Board Level De-skew" (using FPGA clockingresources to account for clock distribution headaches) and specifically forXilinx, "Channel bonding" (using multiple RocketIO transceivers to receivedata in parallel).   The RocketIO transceivers are difficult beasts, atleast if you're not  using a standard protocol.  I'm not sure if the channelbonding can span multiple V2pro devices, but I know it can span multipletransceivers.Not sure on your budget, or application requirements, but it may beworthwhile going to a single, larger part that contains the resources youneed.  It at least partially removes the headache of high-speed PCBdesign/layout.--Josh Model

Symon · Answer

...or at least take all the high speed serial stuff into one FPGA anddistribute it from that one to the others at a slower parallel rate. Also,it looks like V4 could take care of this with its ChipSync thingy for sourcesynchronous application.Cheers, Syms.

rickman · Answer

Yes, you *are* missing something...  ;)D-- Rick "rickman" CollinsIgnore the reply address. To email me use the above address with the XYremoved.Arius - A Signal Processing Solutions CompanySpecializing in DSP and FPGA design      URL  King Ave                               301-682-7772 VoiceFrederick, MD 21701-3110                 301-682-7666 FAX

glen herrmannsfeldt · Answer

>  But now there arises the problem how toI believe most important is to first latch the signals in the IOBto minimize clock skew problems.   Otherwise, an external shiftregister to generate bit parallel signals for input to the FPGA.-- glen

Leroy Tanner · Answer

"Symon" :ok, I agree on that and it might be a good approach to minimize skewing inthe first section. but nevertheless I must synchronize the other FPGAs toeach other, not at a rate of several GHz but say at ca. 300 MHz. In myopinion a central clock isn't an appropriate solution!?

Josh Model · Answer

Think about what a central clock entails from purely a routing perspective.Let's assume you're an SI wizard, and have no issues there.300 MHz would be ~ 3.3 ns per clock cycle.  If I remember my rule of thumb,you've got about 6 inches per 1 ns for the speed of an electrical signal inFR-4 material.  So the worst case match between all your data lines and allclock lines for all FPGA's will be the skew that eats into your timingbudget.Just as an example (I'm not really a layout person, so it's my posteriorspeaking), matching all lines to 4 FPGAs +/- 3 inches seems relativelytricky, but not completely unreasonable.  So now ~1/3 of your entire clockcycle is wasted (more, if you were assuming DDR) before you even get to theFPGA fabric.  it makes laying out your design that much more tricky.Now, in the slightly more real world you've got to throw in the jitterpresent on a 300 MHz clock, impedance mismatches causing reflections,crosstalk on your board with all that data zipping around (because...

Symon · Answer

Hi Leroy,Say you've got 4 FPGAs A, B, C & D. Each gets fed the 300MHz clock, so onthe fabric of each FPGA is CLK_A, CLK_B etc. When you send data from (say)FPGA B to FPGA D, send a clock with the data, generated by FPGA B from itsinternal CLK_B, called (say) CLK_B_TO_D. Use this source synchronous clockwith a DCM in FPGA D to get the data into a BRAM FIFO inside FPGA D. Get thedata out from this FIFO into the fabric of FPGA D using CLK_D. Repeat forall the other paths. Any good?Cheers, Syms.

dave · Answer

There are two ways to approach this problem: (1) have each FPGA perform a part of the process on the entire data stream or (2) have each FPGA perform the entire process on part of the data stream. We once implemented (2) for a bandwidth expander where each chip did the complete process (one clock cycle Huffman decoding, translation of the code to a value, then arithmetic processing) for a portion of the incoming data stream. Each chip was provided a chunk of the incoming data (e.g., in a two-chip system, chip one processed chunks 1,3,5,... of the data and chip two was processed chunks 2,4,6,... of the data). We actually used two on the board because of I/O bandwidth limitations, but the chip was designed to allow for 1,2,4,or 8 chip operation.

-=Dave=-

How To Synchronize FPGAs

Join the Discussion

Didn't find your answer?