new feature - Linguistic Style Matching (LSM) computing
Minor updates - cleaner links and categories retrieval for pages instead of manual extraction - dependencies
structure - [documentation is now available on readthedocs]( - refactoring of most the code - new folder structure
features - page editors retrieval - page parts extraction (links and links title) - revisions diff retrieval, parsing and information extraction
This version is for demo purpose and exploration. This is completely functionnal and already used in some data analysis process. It includes: - data retrieval from wikipedia API - data retrieval from 3rd party (page views analytics) - local files database - exporting features for complex data models (eg. network)