Last Updated: February 25, 2016
·
671
· anthonynystrom

Pattern is RAD

Pattern

Pattern is a web mining module for the Python programming language.
It bundles tools for data retrieval (Google + Twitter + Wikipedia API, web spider, HTML DOM parser), text analysis (rule-based shallow parser, WordNet interface, syntactical + semantical n-gram search algorithm, tf-idf + cosine similarity + LSA metrics), clustering and classification (k-means, KNN, SVM), and data visualization (graph networks).

http://www.clips.ua.ac.be/pages/pattern

https://github.com/clips/pattern