Auto pipeline logic??

Question

Hi all,Using HDL to pipelining manually is a hardy task. And I found sometools like Synplify have pipeline tools. But the pipeline they providedis just insert reg between RAM and logic.My question is: Is there a tool to auto pipeline the logic? Forexample, I want to pipeline the logic by insert N regs. And if thereexists such a tool, what does it modify, HDL or netlist level?Best regards,Davy

Brad Smallridge · Accepted Answer

I think Xilinx, or maybe it's Mentor Graphics, that has a tool called Precision, that is suppose to "push or pull" registers in order to make timing criteria. I don't know what it costs.

b r a d @ a i v i s i o n . c o m

Ben Jones · Answer

Hi Davy,Both Synplify Pro and XST (and probably other synthesis tools) can do thisto some extent.It's also often called Register re-timing. It works like this: you can writeyour big block ofcombinatorial logic in HDL, then add N registers after it (also in HDL),then get the tools topush the registers around to optimize the timing of the circuit.That's the theory! In practice you'd be somewhat foolish to rely on thistoday (Try it!).However, it's likely to become more and more important in future.An area in which XST does this fairly well is multipliers. If you use theright settings and/orattributes, it's possible to write a combinatorial multiply followed by aregister-based delayline, and have the registers pushed back into the adder tree automatically.The original HDLsource is untouched; it's the resulting netlist which is optimized.Cheers,        -Ben-

Davy · Answer

Hi,Yes I know re-timing, it just push pull the register(rely on theoriginal netlist), but not insert register.Is there any tool to insert registers?Thanks!Davy

Ben Jones · Answer

Well, inserting registers changes your design in a fundamental way. Most circuits I can think of would just stop working if you added registers to them at random. Only you, the designer, know exactly how much pipelining it is legal to apply to a given part of your circuit. So I don't believe such a tool exists - certainly not in the general case.

Cheers,

-Ben-

David Brown · Answer

I would think that would be a very bad idea to try to do automatically - it would completely change your timing. It's one thing to automatically do re-timing to improve your margins or your maximum clock rate, but adding registers will change the function of your logic. You might just as well ask for a tool to insert extra logic to improve your design.

John_H · Answer

You have to insert your own registers to make the pipeline a desired latency. You can then let the tool move the logic across those boundaries.

How would you specify to the tool what you want pipelined, what you don't, and what the expected final latency in clocks is? You insert registers in the paths you want piped.

The tool I use to insert registers: vi.

c d saunter · Answer

: > Yes I know re-timing, it just push pull the register(rely on the: > original netlist), but not insert register.: > Is there any tool to insert registers?: Well, inserting registers changes your design in a fundamental way.: Most circuits I can think of would just stop working if you added: registers to them at random. Only you, the designer, know exactly: how much pipelining it is legal to apply to a given part of your: circuit. So I don't believe such a tool exists - certainly not in the: general case.This is very true, but there's no reason a designer couldn't specify a bunch of signals (e.g. the data signal from a combinatorial multiply and associated control signals) and some tool would add aribtrary (to a user specified limit) stages of pipelining to all signals to meet timing, with logic/register shuffling.  This would only work the control and data flows can be aribtrarily pipelined, but many ops can be described this way/A half way house to acheive this is to use current...

Ben Jones · Answer

True, although I don't see much merit in doing it that way. In FPGAs, the pipeline registers are essentially free (because they're there after every LUT, even if you don't use them). So you don't get much advantage from "just" meeting timing - if you have four clock cycles do do something in, then you might as well take all four - who cares? You'll get better results out of the tools that way, too.

Of course, if you *do* care about the latency of your operations and you want to minimize it, then you're already thinking in enough depth about your design that an automated tool would be unnecessary.

Cheers,

-Ben-

P.S. Whenever someone says "automated tool" I immediately envisage a smug paperclip: "I see you're trying to close timing - would you like some help with that?" This of course means that all my subsequent utterances on the subject can be safely disregarded. :-)

Peter Sommerfeld · Answer

Hi Davy,No tool that I know of, but you can write code in such a way that thepipelining is configurable. I've written a few blocks where I couldadjust the pipelining of the block by changing a "pipelining schedule",which was just an array variable containing pipeline-able points in thedesign. By changing this variable, I could change the amount ofpipelines, and therefore the amount of registers used and the fmax ofthe design. This has worked pretty well for me for designs like binarytrees of arbitrary depth. At the top of the code I would have avariable like:  pipeline_schedule(TREE_DEPTH-1 downto 0) := ( 0, 1, 1, 0, 1, 1 );and further in the code where I would have, say a tree I would dosomething like (this is not the actual code, just the idea here):  for i in 1 to TREE_DEPTH-1 generate    for j in 0 to LEAVES-1 generate      if (pipeline_schedule(i)==0) generate        -- just a level of logic        a(i)(j*2)

John McCluskey · Answer

I've tried to write my code this way for some time now.  This is a designstyle which is pretty hard.  You start with an algorithm or computation,and apply it to an arbitrary sized chunk of data, and build the circuit sothat registers are inserted at "appropriate" points during elaboration ofthe structure.   It would be easier if the VHDL language standard wasmodified to support "return generic values" that are computed duringcomponent elaboration and returned as a compile time constant to the upperlevel code that instantiates the component.   The reason for this, ofcourse, is that the lower level component should contain code to calculatethe appropriate latency, and then return this value to the upper level ofthe hierarchy so that the other signal paths can have their latencybalanced.   Since VHDL doesn't let you do this, the only other solution I can think ofis to write functions in a package that perform the latency calculation atthe top level, and then pass the latency as a...

Ben Jones · Answer

Hi John,Yup, that gets said about once a week in this office. :-)Well, the other possibility is to use flow control handshakes at eachpipelinestage and put up with non-deterministic latency. That has its own problems,of course - but I've found both approaches useful in certain contexts.As another poster indicated, if you have a bunch of sites where a registercould be placed, then it's a piece of cake to have an array of booleansas a generic parameter to control the register placement. The problemof finding optimal register placement can usually be solved by brute force:write a script to implement the component using every possible vector withN bits set, for every value of N you're interested in. In fact, you don'tevenneed to do every possible vector (which is a binomial-type thing) becauseyou can easily prove that certain vectors will always be worse than othervectors. This narrows the search space quite a bit (which matters if you'retalking about a 10-deep pipeline, but not so much if...

Auto pipeline logic??

Join the Discussion

Didn't find your answer?