slackbuilds/python/pattern/README

17 lines
687 B
Plaintext
Raw Normal View History

2016-11-07 06:17:12 +08:00
pattern (a web mining module for Python)
Pattern is a web mining module for the Python programming language.
2016-11-07 06:17:12 +08:00
It bundles tools for data retrieval (Google + Twitter + Wikipedia API,
web spider, HTML DOM parser), text analysis (rule-based shallow parser,
WordNet interface, syntactical + semantical n-gram search algorithm,
tf-idf + cosine similarity + LSA metrics), clustering and classification
(k-means, k-NN, SVM), and data visualization (graph networks).
The module is bundled with 30+ examples and 350+ unit tests.
Pattern is written for Python 2.5+ (no support for Python 3 yet).
The source code is licensed under BSD and available from
http://www.clips.ua.ac.be/pages/pattern.