Hello,
I am a Python programmer writing neural network code with binary firing and binary weight values. My code will take many days to parse my large data sets. I have no idea how much fpga could help, what the cost would be, and how easy it would be to access it from Python. The problem is similar to competitive networks, where I must dot product many million-length bit vectors (which only change occasionally) with 1 input vector. Anybody want to estimate the cost, speedup, and value an fpga could offer me?
Seems like this problem shouldn't be so hard, but from the little research I've done I haven't found a good value product that is ready-made, so I'm looking at (multiple?) fpga as a coprocessor.