Description: Annif is an open source toolkit for automated subject indexing. It integrates several machine learning and AI based algorithms for text classification.
Tool for automated subject indexing and classification
Choose a controlled subject vocabulary and train Annif on already indexed documents – it can then suggest subjects for new documents!
Annif uses a combination of existing natural language processing and machine learning tools including TensorFlow , Omikuji , fastText and Gensim . It is multilingual and can support any subject vocabulary (in SKOS or a simple TSV format). It provides a command-line interface, a simple Web UI and a microservice-style REST API.