The training dataset for accelerated machine learning algorithms

Citation Author(s):
Ruixuan Li
Submitted by:
Qi Yang
Last updated:
Thu, 11/08/2018 - 10:34
DOI:
10.21227/fbpn-s032
License:
279 Views
Categories:
5
1 rating - Please login to submit your rating.

Abstract 

A 128-dimensional vector for one document in text format, where each dimension is represented as a single precision floating-point number。

Instructions: 

The training dataset was randomly generated for accelerated machine learning algorithms that the coputing-intensive tasks are offload to FPGA accelerators. And the data is stored as a 128-dimensional vector for one document in text format, where each dimension is represented as a single precision floating-point number, so that we can increase the size of dataset easily to hundreds of GB or even more. The cosine distance is used to measure the vector similarity.

 

Comments

can not get

Submitted by Liu Wei on Mon, 04/26/2021 - 09:28