How the synthesizer acutally works.

Question

Hi guys,      To know how the synthesizer behave,i wrote logic to add 4vectors in three different  i got differnet result from thesynthesizer(used both ISE and synplify).These are the three different approchs i made1.***************************************************************************************************module add(   input clk,   input [1:0] a,b,   input d,   input [63:0] c,   output reg [64:0] out   );reg [1:0] in1,in2;reg in4;reg [63:0] in3;wire [64:0] result;always @ (posedge clk) begin {in1,in2,in3,in4}= {a,b,c,d}; out= result;end assign result= in1+in2+in3+in4;endmodule2.***************************************************************************************************module add1(   input clk,   input [1:0] a,b,   input d,   input [63:0] c,   output reg [64:0] out   );reg [1:0] in1,in2;reg in4;reg [63:0] in3;wire [64:0] temp;wire [64:0] temp2;wire [64:0] result;always @ (posedge clk) begin {in1,in2,in3,in4}= {a,b,c,d}; out= result;end assign temp= in1+in2; assign temp2= temp+in4; assign result= temp2+in3;endmodule3.***************************************************************************************************module add2(   input clk,   input d,   input [1:0] a,b,   input [63:0] c,   output reg [64:0] out   );reg [1:0] in1,in2;reg in4;reg [63:0] in3;reg [64:0] result;always @ (posedge clk) begin {in1,in2,in3,in4}= {a,b,c,d}; out= result;endalways @ (*) begincase({in4,in1,in2})5'b00000: result= in3+3'b000;5'b00001: result= in3+3'b001;5'b00010: result= in3+3'b010;5'b00011: result= in3+3'b011;5'b00100: result= in3+3'b001;5'b00101: result= in3+3'b010;5'b00110: result= in3+3'b011;5'b00111: result= in3+3'b100;5'b01000: result= in3+3'b010;5'b01001: result= in3+3'b011;5'b01010: result= in3+3'b100;5'b01011: result= in3+3'b101;5'b01100: result= in3+3'b011;5'b01101: result= in3+3'b100;5'b01110: result= in3+3'b101;5'b01111: result= in3+3'b110;5'b10000: result= in3+3'b001;5'b10001: result= in3+3'b010;5'b10010: result= in3+3'b011;5'b10011: result=...

John_H · Accepted Answer

Top posting to avoid the 4 pages of original post, included at the end...The synthesizers appear generally not to be smart enough to group the additions by size first.  We can't ask them to do all the work but it would have been nice to get better results without the extra nudge.  The nudge?  In Synplify it's called a syn_keep and there's a similar attribute in XST (though I know not what it's called.The specific knowledge of the hardware can be used to get the "best" results.  Since we know 4-input LUTs are available in the Virtex-4 (larger in the Virtex-5) it would be "most" efficient to add in1 and in2 with LUTs then add that result with in3 and have in4 be a carr-in to this last add.While "temp" values are great for readability, there's no guaranteed behavior when those values are combinatorial with no directives attached; the synthesizer will look at the overall combinatorial "logic cone" feeding the output reg.To get you "best" result, us a 3-bit temp variable(* syn_keep=1 *)...

Newman · Answer

...

Newman · Answer

...

vssumesh · Answer

subin ur thinking very loudly now..... keep it up..... and never let it down.... only by this u can learn new things.... any way abt ur doubt...... we human can understand what ur doing in the algo is same... but not the machines.... ur first two cases are atleast comparable.... we need to think why it differed... like as John suggested may be its due to the blocking assignments.... like in the first case synthesizer can view it as four adds.... it can pump all the available optimization into that.... but for the second case we are forcing the synthesizer to look it as a three separate add operations... at that time the synthesizer may not be able to apply all the optimization techniques.... as john suggested u try to use the non blocking assignments.... tht may free up the synthesizer to look into the prob as a single operation..........

But the third one ... simply different thing...... its actually a decoder.... to decode the {in4,in2,in1} variable then some logic to form the operand to add to the in3..... plus ofcourse an adder to add that variable to in3.... i think u will get the same result with the following code case{in4,in2,in1} op = value based on different cases.... .... ... endcase result = in3 = op;

Newman · Answer

result = in3 + op;It looks like Synplicity thinks methods 1 and 2 are identical.I would think that a variation of method 2 should be able to getskinnied down a bit in the area of XORCY, LUT1, ....  I was wonderingif the map routine during the implementation phase would trim some ofthese out.Interesting academic exercise.  For practical purposes, I think Johnhad it about right for how to group things together.  It would beinteresting what the resource usage of coregen components would bethat were structurally connected together. as stated by John.-Newman

vssumesh · Answer

and subin i didnt observed the last question......what should be the real approch to write in HDL to get most optimisedresult.How can one suggest a general method or guideline for coding.... ithink we can classify it as two separate class....1) general functionalities.... like addition,multiplication,muxingetc... i think here we need to code them as direct as u done in thefirst code.... all the synthesizers i hope will have algos to dealwith that... so no ponit in creating something like 2nd or 3rd codingstyle.... tht looks nt good in the HDL itself....2) The other things are unconventional functionalities... like what weimplemented in the source formatin switching logic.... we know what todo but no machine can translate direct to the optimized HW... so whatwe do we also think abt it and find a way to implement it and code itthat way.....I think when we are coding somrthing we need to differentiate betweenthese two class...

subint · Answer

Thanks for all of the replys,        John you are completly right.. it was because of the groupingof the adders making the difference.But how?...why the out=a+b+c+d is not equal to out = (a+b+c)+d;                Yes i intentionally made those input registers. Thisis the method i follow to generate the worst path using the synplifytool. The blocking and non blocking assignments not making anydifference in 3 of my codes.ut as you suggested the "syn_keep" in the second i am getting the"best" result in both synplify and ISE.By changing the temp size to 3 itself helped to generate the "best"result in the ISE(without the KEEP  ...

subint · Answer

Hai sumesh, The second method is giving me the "best" result.By grouping the small adders together and adding that with the bigger one actually reducing the hardware.But i am surprised how it's implementing(without grouping).

regards sub> and subin i didnt observed the last question......

vssumesh · Answer

aftr par also u rgetting the same ???let me think.... ok put it this way.....64 bit + 1 bit -> needs one 64 bit carry propagation network....Above result + 1bit -> needs one more 64 bit carry propagationnetwork.like that....but suppose....1bit + 1bit -> needs only two bit carry propagation network64 bit + 2bit -> needs 63 bit carry propagtion network....so second one is more efficient..... i am neglecting all the additionssince its all two or 1 bit additions.....Any way i did not felt the power of this grouping neither do icarefully read the johnH first reply.... Sorry.....so i think in a single strecth addition the evaluation is from left toright....ur case (in1+in2+in3+in4) ==> ((2bit + 2bit) + 64bit) + 1bit......need two 64 bit carry chains........try to change that order toin1 + in2 + in4 + in3; .....also dont forget to testin3+ in4 + in2 + in1;.......... i think tht will give the maximumvalue....pls test it and pls let me know... as u knw its been more than twomonths since i last...

John_H · Answer

You appear to be responding to posts and not asking questions so the ability for others to read your message is less important, perhaps.

Do you want others to actually read what you say? I read through your previous post and had sincere trouble following along due to the abbreviations and lack of sentence/paragraph structure. Since you're not asking a question here, I'm just not reading the post.

If you don't care if your messages are read, you don't need to do anything at all. If you'd like to be part of the grand conversation, you will get more people to see what you're saying if you stick to a good written style. Scanning this message that I didn't read, it looks like there are fewer texting-style abbreviations. Great start. Avoid all the dots in your thoughts trailing off and instead use solid sentence structure and formatted paragraphs and your message will be inherently more readable.

I appreciate the interaction from most of the folks on this board (I only have one author on my kill list at this point - so much nicer that way) and would like to see the conversation open and not ignored.

I'm just making a recommendation here, no demands. Your posts today are simply the most difficult to read in the last several months.

Otherwise, thanks for the contributions.

John_H · Answer

The grouping was once (back in the early 90s, at least by some tools) specifically order-dependent.  Since the language became a standard and more synthesizers got better optimizations, the order of operations and the implied grouping with parenthesis no longer make the impact on synthesis one might hope in trying to optimize the code.Since the synthesizers believe they can do a better job by looking at the entire logic cone, the synthesis results *should* be the same independent of order.  The arithmetic elements are one example where it appears the synthesizer optimizations are "a little behind" where we'd want them to be.  I *often* have syn_keep attributes around the adders in my code to make sure the proper "minimum" amount of logic goes into my adder and the register-to-register flow doesn't get broken up improperly.Because the synthesis is based on the logic cone and not the way the equations are grouped, the use of parenthesis or additional temp wires will often affect the...

How the synthesizer acutally works.

Join the Discussion

Didn't find your answer?