Skip to content

A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear are currently embedded.

License

Notifications You must be signed in to change notification settings

DigitalPebble/TextClassification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

00bdf83 · Sep 24, 2021

History

71 Commits
Sep 22, 2021
Feb 21, 2012
Jan 15, 2010
Feb 21, 2012
Jul 11, 2012
Jul 26, 2012
Aug 27, 2010
Jan 15, 2010
Feb 9, 2011
Feb 9, 2011
Oct 12, 2020

Repository files navigation

A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear [https://github.com/bwaldvogel/liblinear-java] are currently embedded.

Standard weighting schemes are provided (occurrences,frequency,tf-idf) for building the vectors. The API aims at facilitating the use of ML for text applications and offers a convenient field-based representation of documents (e.g. title, content, keywords, etc...) with the option to specify a different weighting scheme per field.

A separate project containing examples of use for the API is available on https://github.com/DigitalPebble/textclassification-examples.

It is also used in the GATE TextClassification plugin on https://github.com/DigitalPebble/TextClassificationPlugin. 

About

A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear are currently embedded.

Resources

License

Stars

Watchers

Forks

Packages

No packages published