DDR Mux - how does it work?

Question

I'm looking at the IOB diagram for Spartan3. The output path has a block labled "DDR MUX" that seems like it should do the obvious thing.

It's got two inputs - the data bits.

How does it know when to switch? Does it get both clocks too, through some path that isn't shown?

-- The suespammers.org mail server is located in California. So are all my other mailboxes. Please do not send unsolicited bulk e-mail or unsolicited commercial e-mail to my suespammers.org address or any of my other addresses. These are my opinions, not necessarily my employer's. I hate spam.

Gabor Szakacs · Accepted Answer

Try the Virtex II datasheet. It shows a block diagram where even the two data bits don't connect to the "DDR MUX" so it works with no inputs!

If you go into the FPGA editor you see something similar and while you hold your mouse over the mux the popup description includes the words black box.

I think Xilinx is trying to simplify the diagram to reduce clutter. The mechanism must have the clocks because it has to switch on the rising edge of each input clock (after the programmable inversion). If I had to code this in Verilog it would look like:

always @ (posedge FF1CLK) muxout

newman · Answer

news:...The component template gives some insight on the connections.I used it to forward a clock in a VirtexII, and used the DCM phase adjust to optimize the timing.I was excited about using this for clock forwarding.  Others did not sharemy excitement.  I must be a geek.  Anybody got an extra Virtex-4 Xilinx T-Shirt they are willing to part with.  I don't mind if it is wrinkled :}- Newman----------------------------------------------------------------  component FDDRCPE  port(  -- INPUTS--    C0  : in std_logic;    C1  : in std_logic;    CE  : in std_logic;    CLR : in std_logic;    D0  : in std_logic;    D1  : in std_logic;    PRE : in std_logic;     --  OUTPUTS                          Q : out std_logic    );  end component FDDRCPE;  -- FDDRCPE

Newman5382 · Answer

Just as a follow-up: The original question is a good one. When one sees a 2:1 mux, one may wonder if it is susceptable to static hazard one and static hazard zero glitches. I've seen that Xilinx recommends using this component for clock forwarding, which implies that it is not subject to such. When I reviewed the design in the FPGA editor, the inverter in the C1 path was apparently absorbed into the IOB. In some App note for high speed designs, it was recommended that the 180 DCM output be used for C1. Since my design was not high speed, and I was running out of clock buffers on that side of the chip, I opted for the merged inverter. Apparently the "DDR Mux" uses two clocks, and it is unclear exactly how it does that.

- Newman

Gabor Szakacs · Answer

It's clear that the "mux" is really an integral part of theDDR flop design.  It has been suggested that the individualflip-flops are not really wired as shown, but rather wired so thatthe outputs can be XORed together to form the final Q output.This method does not have a glitch if the Q doesn't switch.See VHDL code below.The only reason I can see to use 180 DCM output (which uses extraglobal routing resources) is if the clock you're using is not50% duty cycle.  Normally DCM outputs are adjusted to 50% unlessyou tell the tools specifically not to.  Thus using 0 phase clockand its inverse (yes the inverter is pulled into the IOB) givesjust as good results.  I think newer reference designs use thismethod instead of the 0 and 180 method.I have used DDR flops as clock drivers because they make a goodlow skew (but not zero delay) copy of a global clock signal withoutextra routing resources related to using multiple DCM phases or2x clocks.  They also neatly match the data delay in DDR...

Hal Murray · Answer

I started thinking along that line. I couldn't make it work.

Consider the case where the FFs are fed by 1/0 constants to make a clock. The value of the FFs never changes.

I'm sure the actual implementation has some interesting magic.

At least one place in the Xilinx documentation suggests using clk and clk180 in preferance to clk and clk-inverted for better results. I think the idea was that there is more skew on rising vs falling edges as compared to loaded and not-loaded clock distribution nets.

Might be fun to measure. I wonder how hard it would be to make something that balanced the clock loading. Probably doable in FPGA editor by hand for a one-shot experiment.

You could use a DLL with external feedback if you want to avoid that delay. But that delay doesn't matter for the normal clock forwarding case.

DDR Mux - how does it work?

Join the Discussion

Didn't find your answer?