Info on FPGA routing algorithms?

Question

I've been reading papers about routing in island-style FPGAs. Most cite Xilinx architectures, though I've looked at a few papers about H-tree networks. Often, there is a simplified model being used. There is mention (kind of dated) that commercial routers use derivatives of maze routing, with some more recent mention of channel routing. Is there some papers that can give a good idea of how the real industry software does global and detailed routing, what algorithms are actually used? What is the typical lag time between the advent of certain approaches in conference/journal papers versus uptake in commercial routers? I'm kind of curious how much I can should trust the papers as an indication of actual practice. As well, I am still in rummaging mode, and have yet to rummage into a paper that shows how the switches in the switch boxes are actually explored to get detailed routing, given a non-full crossbar. I've looked at

Wu &Tsukiyama et al.: Graph analysis of 2D FPGA routing

but I'm hoping to rummage into something more applied.

Thanks.

Fred

Paul Leventis (at home) · Accepted Answer

Hi Fred,These days, the best academic routers do combined global + detailed routingusing negototiated congestion.  The base algorithm employed is known asPathfinder -- see "Placement and Routing Tools for the Triptych FPGA" by C.Ebeling, L. McMurchie, S.A. Hauck, and S. Burns.  A widely used tool is VPR(by Vaughn Betz), which performs clustering, followed by placement (viasimulated annealing), and then timing-driven negotiated-congestion basedrouting.  It uses an improved form of the Pathfinder algorithm, but whatexact improvements it adds I forget, since VPR was commercialized and I nolonger remember what was in the original academic version!  I *think* thatVPR (or some improved version of it) is still the best academic router,quickly providing small channel widths and good timing.There is a book "Architecture and CAD for Deep-Submicron FPGAs" by Betz,Rose, and Marquardt that covers the algortihms employed in VPR and T-VPACK(a timing-driven clusterer by Marquardt).  It also covers the...

Yttrium · Answer

thanx for the info, was looking for the same information ...thxroutingC.VPRtopoint.

Fred Ma · Answer

Thanks, Paul,I did in fact encounter Vaughn Betz's papers on VPR.  There is prettyintuitive description of the VPR, and Pathfinder is described as aniterative application of shortest path.  I'm still gnawingbreadth-first on promising references from web-found papers.  I'lllook more closely at Pathfinder to see if it details how the switchboxsettings are determined.  Thanks for the pointer to Vaughn's book.If anyone can comment on how relatively wide-spring are the variousalgorithms, both in academia and industry, that would be helpful.Papers often reference other papers, but don't actually indicate whichalgorithms are used by which commercial tools, and how prevalent arethe various commercial tools.  Vaughn's website says Right Track isnow part of Altera, so maybe Altera's own tools may start using ideasfrom VPR.  I have yet to come across information about what was thealgorithm prior to this.  What about Xilinx?  Is it the case that theydon't like to disclose the actual inner workings...

Paul Leventis (at home) · Answer

Hi Fred,switchboxI'm not quite sure what you mean by this.  If you're referring to how thetopology of the switchbox is determined (an architectural decision), thereare some other papers I can point you to.From a routing algorithm perspective, it is really quite simple.  Yourepresent the entire chip as a graph where each node represents a routingresource -- a block input, block output, or routing wire.  Directed edgesare placed between these nodes to represent the presence of a (programmable)connection from one resource to another.  So a swich box that is not acomplete cross-bar is encoded in this graph as edges.  For each net, therouter starts at the source block output, traverses the edges from thatnode, and assigns a desirability/cost to each of the nodes seen and placesthem in a heap.  It then removes the best node from the heap and repeatsfrom there until it hits the desired destination (a block input).  This isknown as an a-star or best-first traversal, and clearly all...

Joseph H Allen · Answer

Do any placement algorithms try to make regular structures? For datapaths, for example? This is how a human would do it, and might give excellent results in some cases.

Although, you certainly want to architect FPGAs with enough routing resources so that regular structures are not needed.

-- /* snipped-for-privacy@world.std.com (192.74.137.5) */ /* Joseph H. Allen */ int a[1817];main(z,p,q,r){for(p=80;q+p-80;p-=2*a[p])for(z=9;z--;)q=3&(r=time(0)

+r*57)/7,q=q?q-1?q-2?1-p%79?-1:0:p%79-77?1:0:p158?-79:0,q?!a[p+q*2 ]?a[p+=a[p+=q]=q]=q:0:0;for(;q++-1817;)printf(q%79?"%c":"%c\n"," #"[!a[q-1]]);}

Tom Seim · Answer

1  On optimum switch box designs for 2-D FPGAsHongbing Fan; Jiping Liu; Yu-Liang Wu; Chak-Chung Cheung;Design Automation Conference, 2001. Proceedings , 18-22 June 2001 Pages:203 - 2082  General models and a reduction design technique for FPGA switch boxdesignsHongbing Fan; Jiping Liu; Yu-Liang Wu;Computers, IEEE Transactions on , Volume: 52 , Issue: 1 , Jan. 2003 Pages:21 - 303  Not necessarily more switches more routability [sic.]Yu-Liang Wu; Chang, D.; Marek-Sadowska, M.; Tsukiyama, S.;Design Automation Conference 1997. Proceedings of the ASP-DAC '97.Asia and South Pacific , 28-31 Jan. 1997Pages:579 - 5844  The effect of switch box flexibility on routability of fieldprogrammable gate arraysRose, J.; Brown, S.;Custom Integrated Circuits Conference, 1990., Proceedings of the IEEE1990 , 13-16 May 1990Pages:27.5/1 - 27.5/45  General models for optimum arbitrary-dimension FPGA switch boxdesignsHongbing Fan; Jiping Liu; Yu-Liang Wu;Computer Aided Design, 2000. ICCAD-2000. IEEE/ACM...

Fred Ma · Answer

I sort of got lost with the heap, but don't worry, I was just getting a rough idea. If needed, I will look up A* (I have papers on it). The representation for wires/switches above matches that in Pathfinder, and I noticed that Betz's VPR also has tricks to cut down some work in restarting the wave front for multiterminal nets. The use of arcs to represent switches seems to get rid of the division between detailed and global routing.

Yes, I was noticing that in recent papers.

Aaawwww. OK. Thanks. I understanding why there is a paucity of such information.

Fred

Fred Ma · Answer

Thanks for the references, Tom.  I looked briefly at the abstracts of a few,and they are quite relevant.  I will look at them in more detail.Fred-- Fred MaDept. of Electronics, Carleton UniversityOttawa, Ontario, Canada

Fred Ma · Answer

Joseph, I'm no expert in the area, but seem to recall seeing this. Look up the authors of C compilers for Garp, I think one of them works on a fast linear placement algorithm. It might be from arraying bit slices. I expect the granularity of one's slice to be highly dependent on the target platforms array architecture.

That's the constant balance, it seems. Deciding ow to focused to make your application domain, which determines the suite of applications (and kinds of operations) you need to support, which helps you avoid excessive flexibility in the array architecture. It's pretty vague, but it seems to be the general motivation for reconfigurable logic.

Fred

Info on FPGA routing algorithms?

Join the Discussion

Didn't find your answer?