2

I try to use Altera's floating point IP to generate half precision instead of single (32-bit) blocks for addition , multiplication etc. However when configuring the IP it seems that half precision fp needs a lot more LUTs and has far more latency in cycles than the 32-bit counterpart. This seems contrary to the assumption that half precision fp should be faster than single precision and occupy less circuit area....has anybody used altera IP half precision fp blocks? Fid you see any improvement compared to single precision?

Mirza Sisic
  • 2,401
  • 4
  • 24
  • 38
user2609910
  • 157
  • 8
  • Larger latency, but what about the throughput ? More LUTs isn't unexpected: with only 16 bits you have the opportunity to use them (at most 65536 entries), while for 32 bits, this is unthinkable. –  Apr 25 '16 at 09:09

0 Answers0