Virtex 4 FIFO16 blocks - Corruption ?

Question

Hi,We're faced with a strange problem ...While investigating a bug in one design, we could only observe thatbehavior on real board and not in simulation.Using chipscope, we finally traced down the problem by monitoringboth write and read port of a FIFO16 configured as 18x1024, using thesame rd/wr clocks. That fifo was used in a "weird" way, by setting aALMOSTFULL threshold very high (but still within spec), so that it turnon very quicly. And what we observed is that we push a data with someparity bits (which are not 'true' parity but some critical control), wecontinue to push, the almost full goes up (normal), and we still push(we still have plenty of room) and at the same time we re-read butslower (not at each clock cycle) and when we finally re-read the datawhere the parity bit was set, the data (15:0) are there but the paritybit is not, it's just 0 ...The chipscope 'probes' were tied directly to the fifo signals, no logicin between. That fifo is supposed to cross clock domains but fordebugging, we just sent the same clock everywhere. And the behavior ofthe surrounding logic is consitent with that bit being missed.Instead of using ALMOSTFULL set to a very high value, we used notALMOSTEMPTY (here since we're debugging with just 1 clock domain, it'sok), and there it looks like we never observe such a miss.Has someone ever observed such a behavior ?	Sylvain

Ray Andraka · Accepted Answer

Have you got any resolution on this? Have you opened a case with Xilinx? What does Xilinx have to say about it?

I am aware that some people have had problems with the FIFO16 not working correctly. I had an issue with trying to use the FIFO as a synchronous fifo (it is async, so there is a possibility with some ambiguity on the flag latency when both clocks are the same). I have asked Xilinx repeatedly to document this behavior prominently in the user guide, but so far they have only quietly acknowledged that the user has to be careful if read and write clocks are the same.

That said, your problem is different than the one I experienced and appears to be a more serious problem in the FIFO16 logic. You are not the first person I've heard state they had problems with the fifo16 async behavior. There may be some issues with the flag logic for asynchronous use as well.

I do find it interesting that Altera was forthcoming with their recent problems with dual port memories. I hope that Xilinx is equally forthcoming if there is indeed a problem with the FIFO16 logic.

Austin Lesea · Answer

Ray,The bug for use of the async FIFO synchronously has been acknolwedged, and we apologize for not getting it out there more prominently.  But:In our defense, it is unusual (or at least, so far we think it is unusual) where the read and write clocks are tied directly together (why use a FIFO at all?  I guess it is a really useful structure, so even when used this way it is too useful to ignore....?).The solution is to not source the two clocks from the same source directly, but place a small delay in one, or the other.The problem does not exist in the asynchronous case, as it takes two subsequent clock cycles on BOTH clocks (at exactly the wrong times) to cause the problem.  As long as the probability of two adjacent clock cyles not coming in on both clocks exactly the same just as you are getting full (or is it empty? I'm not the expert on this), it works fine.Sometimes with problems like this (that are difficult to even cause) it doesn't make sense to put up a billboard that it is...

Ray Andraka · Answer

Austin,You are kidding as far as the usefulness of a synchronous fifo (one which has both sides clocked by the same clock), right?  This is arather common structure in pipelined designs, it is an elastic buffer. Useful, for example, for processing bursty data at a more relaxed rate than the data is presented. I'd be hard pressed to find one of my designs that does NOT have a synchronous FIFO in it.  The solution with the "small" delay is fine if you are not pushing the performance envelope, but it will destroy timing closure in designs that are.  For example, I have a floating point FFT design with a target clock rate of 400 MHz in an SX55-10 part... basically running at the DSP48/memory speed.  It has synchronous FIFOs in it, and there is no room in the timing for adding small delays to clocks.  This is a real limitation to the FIFO16 design, and has cost me several weeks of debug and redesign time to find and work around it.  It should be prominently highlighted in the user guide...

Sylvain Munaut · Answer

Hi Ray,Ray Andraka wrote:My colleague had some contact with our distributor but afaik, no news yet.Looking at the xilinx answer record , I saw that fifo usign FIFO16blocks generated with an old version of fifogenerator could show somedatacorruption problem and that usage of the new one is recommanded ...but I didn't use coregenerator, i instanciated FIFO16 directly (coregendoesn't have first word fall thru anyway ...)I haven't opened a webcase myself yet, ... often before "bothering"xilinx peoples, I want to be sure ;p I've tried to reproduce the problemwith a far simpler design but so far no luck ... (even in the fulldesign it's quite "rare" but 1 times suffice to lock it ...)What exactly is the problem if the clocks are the same ? (what behaviourcould happen ?)Well, here we use the fifo synchronously ... They are meant in thefuture to be used asynchronously but for testing, we've put everythingat the same clock. But other part in the design will always use themsynchronously so I must...

Sylvain Munaut · Answer

Where can I get detailled infos about it ? (to be sure not to run intoit, or at least that it doesn't cause trouble in my design ?)	Sylvain

Austin Lesea · Answer

Sylvain,I expect the fastest way is to open a webcase requesting the information.As I already stated, if both read and write clocks are from the same BUFG net, then this may (will) probably be an issue at some process/voltage/temperature corner (hence the indsidiousness of the issue).A quick fix is to drive one of the clocks from the other edge (one rising, one falling) which may require another BUFG resource (in order to be sure the delay doesn't put you right back where you started).It is my understanding that a macro will be created to instantiate the sync FIFO with the required offset delay automatically in the best way we can (probably using fabric resources, like a LUT, doubles, hexes, etc.).The issue as I was told is that at the critical instant, the almost full/almost empty flag assertions will be correct, but if the event occurs again on the very next clock cycle, the flag will reset to 0, which will not be correct (as the FIFO is still almost full, or almost empty if nothing...

Ray Andraka · Answer

This is exactly what I mean by the problem being hidden. I searched the answers database for FIFO16, and did not turn up anything regarding the known synchronous behavior problem, nor any async problems. It may still only be in the internal database, if it is even there. In debugging stuff like this, I've always assumed the silicon is good and that any problems are a result of the design until I can prove otherwise. As a result, you don't suspect the FIFO itself as being the problem. That can lead to a tremendous amount of debugging effort before finding out there is a problem or unpublished limitation with the silicon. Considering how much time I spent fiddling with this problem, I suspect there are literally thousands of manhours put into debugging the same problem in different projects simply because Xilinx doesn't want to advertise a limitation with their design.

The problem with the synchronous usage is that the flag circuit is an async design. When the clock is the same to both sides, and a read and write are done on the same clock cycle, the flag circuit displays a one clock jitter in the timing of the flag outputs, such that the word written in at the same time the last one is read out may or may not make the fifo show empty. If empty does get set, it then takes something like 3 clocks to go away, so you wind up with a non-deterministic behavior. It is an artifact of using an async flag circuit.

BTW, finding stuff in the answers database is a lot like finding a needle in a haystack, provided you even know what you are looking for.

johnp · Answer

Ray -I've got to agree with you that finding stuff in the answer data baseishit-n-miss at best.  I use XIlinx parts in my designs and I like theparts,but, the support from the web site leaves a lot to be desired.  ThankgoodnessPeter and Austin pay attention to this group!I often think that web masters should be forced to sit with users for awhileso they end up understanding how slow and poor the user experience is.My most recent frustration was trying to find information on a DCM bugrequiring the bitgen centered option.  Good luck finding info on it.John Providenza.

Virtex 4 FIFO16 blocks - Corruption ?

Join the Discussion

Didn't find your answer?