Sparse data support with example #2364

alemagnani · 2015-04-25T06:50:01Z

This PR is a replacement of #937 rebased to master and with an added example

This adds some basic support for sparse data in CSR format. The main thing is a SparseBlob to store the sparse data and an extension to InnerProduct that handles both dense and sparse depending on what is presented at the input. A new data layer is added to read data from DBs.
Some more details:

subclass of Blob that handles matrices in CSR format (SparseBlob)
subclass of InnerProductLayer (SparseInnerProductLayer) to deal with a sparse blob as input (it handles dense at the same time)
a new data layer (SparseDataLayer) that reads from a DB sparse vectors and creates SparseBlob
support for CuSparse
new sparse math operations on both GPU and CPU
small change to Syncedmem to make it possible to own/not own GPU data.
new ProtoBuf object SparseDatum to store sparse vectors.
changed the way Net creates Blobs to support sparse blobs. Basically the bottom layer gets to decide what Blob to create on top. This way the SparseDataLayer produces SparseBlobs (see the new method in layer_factory.cpp). There could be other ways to achieve the same results.
exstensive test for the new math function and the new layers
2 different implementations of GPU sparse kernels (one is commented out). This is because I found out that CUSparse has very poor performance at least in the way I used it. This is because it support sparse multiplication not for all transpose cases.
simple example with a bag of word model classification with logistic and network with one hidden layer. Similar in spirit to the "hdf5_classification" example.

…ayer

sirotenko · 2015-04-25T08:02:20Z

Thanks for a great PR. Sparsity is very promising topic.
In my experiments some time ago I found that not all sparse libs support sparse-dense operations. Also on a mnist model I found that amount of sparsity needed for similar performance as for a dense case should be about 5% when using cuSparse, dense weights and sparse feature maps.
I'm wondering if you have you tried to compare dense vs sparse nets in terms of performance?

alemagnani · 2015-04-25T08:18:45Z

Unfortunately I don't have a good comparison available. I could also commit a data layer that makes the sparse data into a dense blob before loading it to the network. This could be used to compare directly the 2 approaches:

sparse data times dense weights
sparse data -> dense batch times dense weights

buaaliyi · 2015-07-01T04:27:49Z

Great PR.

I've tried to use this patch to train my sparse data. However, I found there were so many memory re-allocations, due to the different non-zero numbers (nnz) between mini-batch data which requires to reshape the prefetch_data. All those memory alloc/free may cost extra times besides fp/bp computing. Is there any future plan for this issue? Thanks

bhack · 2015-07-11T10:04:28Z

What do you think of this project https://github.com/btgraham/SparseConvNet/?

yangjunpro · 2015-11-24T09:26:49Z

Guys, may I know the progress or plan of merging this Sparse support into Caffe master?
For our on-hand tasks, we do need sparse data support, if there isn't any short-term plan for merging this into master, we will build our own or just based on alemagnani 's nice commit.

beniz · 2015-12-07T06:50:49Z

@yangjunpro I'm interested and can help on / test a merge.

beniz · 2016-04-29T11:44:44Z

This PR is very useful. We need it to scale to sparse NLP entries, therefore I have reworked it so that it is up to date with our (customized) Caffe master, here: https://github.com/beniz/caffe/tree/master_dd_integ_sparse

It's been a bit of a struggle to refactor the code (1), but everything is working fine on both CPU and GPU (e.g. @alemagnani initial example based on 20 newsgroups), as well as unit tests.

I understand that maintainers either could not find the time to integrate this great piece by @alemagnani or judged that it should be implemented otherwise, but for us, this is great addition, and therefore we will maintain it against Caffe master branch over time.

I could provide a PR or a patch against current master if there's interest in it. Our Caffe version slightly differs from the official one in that we turn the LOG(FATAL) into exceptions, and some other non-invasive modifications, and for this reason the few commits from our custom tree would not apply as is to current Caffe master branch.

FTR, at this point there's nothing we can quantify about the experience reported by @buaaliyi but we will try to provide informative measures.

Thanks @alemagnani for the great initial work!

(1) this includes the templating of DataReader to work with SparseBatch and SparseDatum, with original code now broke down and appearing into BasePrefetchingSparseDataLayer and SparseDataLayer.

Sparse data support in inner product with corresponding sparse data l…

3a8ee44

…ayer

alemagnani mentioned this pull request Apr 25, 2015

Sparse Data Support #937

Closed

shelhamer added the JD label Apr 27, 2015

beniz mentioned this pull request Jun 6, 2015

Support for sparse data in Caffe jolibrain/deepdetect#8

Closed

chenjx1005 mentioned this pull request Dec 15, 2015

Support Sparse Matrix Input apache/mxnet#773

Closed

beniz mentioned this pull request Jun 16, 2016

Add pruning possibilities at inner_product_layer #4294

Open

shelhamer added the focus label Jul 11, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse data support with example #2364

Sparse data support with example #2364

alemagnani commented Apr 25, 2015

sirotenko commented Apr 25, 2015

alemagnani commented Apr 25, 2015

buaaliyi commented Jul 1, 2015

bhack commented Jul 11, 2015

yangjunpro commented Nov 24, 2015

beniz commented Dec 7, 2015

beniz commented Apr 29, 2016 •

edited

Loading

Sparse data support with example #2364

Are you sure you want to change the base?

Sparse data support with example #2364

Conversation

alemagnani commented Apr 25, 2015

sirotenko commented Apr 25, 2015

alemagnani commented Apr 25, 2015

buaaliyi commented Jul 1, 2015

bhack commented Jul 11, 2015

yangjunpro commented Nov 24, 2015

beniz commented Dec 7, 2015

beniz commented Apr 29, 2016 • edited Loading

beniz commented Apr 29, 2016 •

edited

Loading