TML has moved to http://www.villalon.cl/tml.html and the code to https://github.com/villalon/tml
Features
- Document indexing and selection using Apache's Lucene
- Fast VSM generation with several local and global weights (term - doc matrix)
- Dimensionality reduction using SVD or NMF for LSA or related.
- Meta-data annotators (PennTree grammar parsing).
- Operations: Document distances, topic clustering, keyword extraction, and many more!
License
Apache License V2.0Follow TML - Text Mining Library for LSA & CMM
Other Useful Business Software
Unlock Free Courses and Advance Your Career
Discover Coursera’s wide range of free courses offered by top universities and industry leaders. Whether you’re looking to gain new skills, earn certificates, or explore a new field, Coursera offers flexible learning opportunities to fit your schedule. Start learning for free today and unlock the path to career growth, all at no cost to you!
Rate This Project
Login To Rate This Project
User Reviews
-
It seems to be good, but there are some errors that dont let the program load correctly the library ( Abstract Annotator constructor receives parameters but PennTreeAnnotator doesnt receive)
-
very good library for doing text mining
-
great