Skip to content

Arith2/Recommend_inference_distributed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RecEasyNet

In this project, a large recommendation model (three-layer forward neural network) is deployed in FPGA clusters. Communication among nodes is realized via Send()/Recv() functions from EasyNet.

Five bitstreams are generated:

  1. node_1_mem_first: as the start node, do memory reading and computation in layer 1
  2. node_1_compute: do computation only in layer 1
  3. node_1_mem: do memory reading and computation in layer 1
  4. node_2: do computation in layer 2
  5. node_3: do computation in layer 3 and output layer

The basic communication pattern is in a ring method like: node_1_mem_first -> node_1_compute -> node_1_mem -> node_1_compute ... -> node_1_mem -> node_1_compute -> node_2 -> node_3.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published