Skip to content

Yang Gao's implementation of tf-idf text indexing scheme, predict doc similarity by cosine similarity.

Notifications You must be signed in to change notification settings

ChangQF/tfidf_cosine_cpp

 
 

Repository files navigation

tfidf_cosine_cpp

Yang Gao's implementation of tf-idf text indexing scheme, predict doc similarity by cosine similarity. Refer to: http://en.wikipedia.org/wiki/Tf–idf; yet I use normalized tf instead of raw tf.

It can serve as a baseline for more complicated text indexing and retrieval models, such as topic model.

usage

  • see "run_examples.sh" for example usage.

dependencies

  1. external libraries, such as Eigen and tclap, are included; therefore the code is ready to run

compiling

  1. for initial build, type "make";
  2. if you modify code, type "make rebuild"

questions

for questions, comments or to report bugs, contact Yang Gao(USC/ISI) at [email protected]

About

Yang Gao's implementation of tf-idf text indexing scheme, predict doc similarity by cosine similarity.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 48.0%
  • C 30.8%
  • Fortran 20.6%
  • Shell 0.2%
  • Python 0.1%
  • JavaScript 0.1%
  • Other 0.2%