It may be possible in a single FPGA chip. I found a 15 million gate device but not inexpensive and no free tools. Maybe time to try and crowdfund my "hobby"

I suppose I could do a proof of concept version with only 4 lanes (1/16 the size) and hopefully make the number of lanes a core parameter.