How many Altera LE's to Xilinx Slices????

Question

Hello All,I've been designing with Xilinx FPGAs for a while so I'm used to the"Slice" concept. I'm looking at Altera's Max II as a nice possiblesolution for a design.I took my VHDL code and it synthesized to 40 Slices in a Spartan III.Then I took the same code and sythesized it for a Max II (usingQuartus II now) and it was 71 LE's.I realize a blanket statement 71 LE's (approx. =) 40 Slices, is totalydependant on how the code is sysnthesized.But is a approximate 1 Slice = 2 LE's a pretty close all aroundestimate.ThanksEric

Ben Twijnstra · Accepted Answer

Hi Eric,Give or take ~10% as a design-dependant margin and you should be OK.Best regards,Ben

rickman · Answer

The problem is not a hardware issue, but a granularity issue.  Slicesare not a good measure of how much logic your design is using.  Sliceshave two LUTs and two FFs.  If one FF is used, the slice is counted asused.  You are better off determining how many LUTs and FFs are used ineach design.  They are much more comparable although there will befamily dependant differences in how well the designs can pack into thelarger granules.  Mostly the newer parts will pack logic and FFs moredensely than the older parts.  -- Rick "rickman" CollinsIgnore the reply address. To email me use the above address with the XYremoved.Arius - A Signal Processing Solutions CompanySpecializing in DSP and FPGA design      URL  King Ave                               301-682-7772 VoiceFrederick, MD 21701-3110                 301-682-7666 FAX

H. Peter Anvin · Answer

Followup to:  By author:     (Guitarman)In newsgroup: Well, given that 1 slice = 2 LUTs + 2 FFs + some more logic, and 1 LE= 1 LUT + 1 FF + some more logic, it would be expected.	-hpa

Paul Leventis (at home) · Answer

Hi Eric,Yes, that's a good 1st order estimate.  We believe that 1 Slice is equal toabout 1.8 LEs based on average results across a suite of designs, butmileage will vary from design to design -- this lines up well with yourresult though.One thing you should do is ensure that the CAD tool is trying to use as fewLEs (and slices for Xilinx) as possible.  When you are not filling up thedevice, Quartus will not try too hard to put LUTs and FFs into the sameLE -- if there's any chance it will hurt rather than help timing, it willavoid it.  When you start filling the device close to capacity, Quartus willtry to pack more aggressively.  This is the default "auto" setting forregister packing.To artificially force Quartus to pack as aggressively as possible into LEs,go to the menu Assignments/Settings... select the Fitter Settings tab, andclick the "More Settings..." button.  There is a setting called "Auto PackedRegisters -- Max II".  Setting this to Minimize Area w/Chains will cause themost...

Walter Gallegos · Answer

"Guitarman"  a écrit dans le message denews:I disagree,  both architectures are different, you can't compare it in thiswayhave how many slices into the following code ?.....        DI : in std_logic;        DO : out std_logic;        CLOCK : in std_logic;............   signal temp: std_logic_vector(15 downto 0);......begin   Demo : process(CLOCK)   begin      if rising_edge(CLOCK) then         temp

Hal Murray · Answer

What would make the timing better if the LUT and FF are not packedin the same LE?I'm assuming that there is a very good path connecting the LUT/FF inthe same LE because it is such a common case.  What makes notusing that faster?-- The  mail server is located in California.  So are all myother mailboxes.  Please do not send unsolicited bulk e-mail or unsolicitedcommercial e-mail to my  address or any of my other addresses.These are my opinions, not necessarily my employer's.  I hate spam.

rickman · Answer

He is not talking about a LUT and FF that are connected, he means onesthat are separate.  Like a FF with the D input connected to the outputof another FF and a LUT that has its output going to another LUT only. Unless there is a shortage of IO in the LAB, they can share the sameLE.  Same thing in the Xilinx slice.  Due to crowding of the routing, itmay result in a faster design to keep them separate.  -- Rick "rickman" CollinsIgnore the reply address. To email me use the above address with the XYremoved.Arius - A Signal Processing Solutions CompanySpecializing in DSP and FPGA design      URL  King Ave                               301-682-7772 VoiceFrederick, MD 21701-3110                 301-682-7666 FAX

Paul Leventis (at home) · Answer

Hi Hal, Rick:Rick's got it mostly right.  The Stratix/Cyclone/Max II LE/ALMs can have anumber of register/LUT pairings:1. LUT feeds FF2. FF feeds LUT3. Unrelated FF and 3-input LUT4. FF->FF connection from adjacent LE and a 4-input LUT (a register chain)For example, we could pack an 8-bit shift register in with 7 4-LUTs and 13-LUT to form 8 LEs.As Hal observed, it seems like doing #1 (or #2) is always a win.  If youlook at one FF, in our architecture we can choose to pack it with its fan-in(#1) or fan-out (#2).  For example, if the critical path of the design is onthe output of the FF, through only one of its LUTs, using packing #2 is thebetter choice for that flop.  So there is an interesting optimizationproblem here.Some of the LEs created by #1 or #2 will have two seperate LE outputs (theFlop and the LUT) in the event that the FF/LUT connection is not singlefanout.  In theory, these multiple output LEs create a bit more routingpressure and so you may hurt timing more by making one...

Nicholas Weaver · Answer

Not just routing, but also placement: The separate pieces (FFs, LUTs etc) are not placed independantly, but are packed together and then placed. Thus if unrelated logic is packed together inappropriately, the placement for the packed component may be significantly worse than if each component was placed separately.

Walter Gallegos · Answer

The answare is      1 slice into a Spartan 3    16 LE   into a MAX-IICan you compare this architectures as  1 Slice = 2 LE's  ?Walter."Walter Gallegos"  a écrit dans le message denews:this

Arash Salarian · Answer

I agree that there some areas that you can't simply compare the two architectures. For example, I had an old design with an Altera 10K series that used a fully async RAM block. Now, move it to a Spartan 3 architecture and you see that you should use the whole chip just to make that block of async RAM! However, it is perfectly understandable that a user might need to compare different available options and to do this, he/she would need to have rough estimates to compare a Xilinx device to that of Altera. For example, recently I had this interesting offer for a an FPGA prototype board with the same price of $99 for an Altern EP1C12 or a Xilinx XC3S400. I would like to use a prototype board for very different designs so I had to compare between the two chips. As I program in VHDL and use synthesize tools, I don't really care for any specific architecture (unless something like your example or my example above happens) and the thing that matters in cases like that is you only look for the BIGGER FPGA. To do it, you need to compare and to compare you can only use rough estimates. Personally, I find the simple equation of 1 Slice = 2 LE a very good rough estimate and for many designs it gives you a good answer. You have a very specific design and need a very good answer? Fire your synthesize tool and see how much resources you'd really need!

How many Altera LE's to Xilinx Slices????

Join the Discussion

Didn't find your answer?