» TextGarden

See full content »

Resource type:

Machine Learning and Data Mining Software

Description:

Set of software tools for supervised and unsupervised classification, web mining, visualization, etc.. Written in C++, running on Windows and GNU / Linux via Wine. License undetermined, freely usable for research

Resource link:

» TIMBL-5.1

See full content »

Resource type:

Machine Learning and Data Mining Software

Description:

Decision trees implementation based on KNN classifier. Package includes IB1, IB2, TRIBL, TRIBL2 and IGTree algorithms, and provides several weight metrics.
Python-TiMBL language. License freely available for research and education.

Resource link:

» TnT

See full content »

Resource type:

NLP and IR Software

Description:

Part of speech tagger for tasks of natural language processing. Optimized for speed and training in a wide variety of documents. Free License Agreement for nonprofit research.

Resource link:

» TREC_EVAL

See full content »

Resource type:

NLP and IR Software

Description:

Text Retrieval Conference. Standard tool used by the TREC community to evaluate ad hoc retrieval runs, giving a result file and a standard set of known results.

Resource link:

» Treetagger

See full content »

Resource type:

NLP and IR Software

Description:

Part of Speech Tagger. Executable for Sparc workstations, Linux and Windows PCs and Macs. Free distribution

Resource link:

» WEKA

See full content »

Resource type:

Machine Learning and Data Mining Software

Description:

Java Toolkit for data mining and machine learning. The algorithms can be applied directly to a dataset or called from your own Java code.
Weka contains tools for preprocessing, classification, regression, clustering, association rules, and visualization. GPL License

Resource link:

» Wikipedia XML Corpus

See full content »

Resource type:

Machine Learning and Data Mining Software

Description:

Corpus of Wikipedia articles in several languages ​​oriented to different tasks: classification, text retrieval, multimodal, etc. GNU Document Licence

Resource link:

» XELOPES

See full content »

Resource type:

Machine Learning and Data Mining Software

Description:

Library for data mining. It has versions in C++ and Java. Exists a GPL version

Resource link:

Related links:

» Xerces-J

See full content »

Resource type:

NLP and IR Software

Description:

Parser working with XML in Apache Xerces family. It has a framework to build components for the parser and modular settings easy to program. Available under the Apache Software License.

Resource link:

» Zprise

See full content »

Resource type:

NLP and IR Software

Description:

Z39.50-1995 Prototype Indexing and Search Engines. Deals with documents and questions like lists of words and as response is given a list of documents ordered or ranked by their statistical similarity to the query. Supports for improving the question using relevance feedback. It is in the public domain, freely available

Resource link: