A Python crawler for extensions from the Chrome Web Store.
Go to file
Michael Herzberg 39d7bf0330 Deal with missing annotation block in reviews. 2017-06-20 08:03:15 +01:00
ExtensionCrawler Deal with missing annotation block in reviews. 2017-06-20 08:03:15 +01:00
.gitignore Ignore archive directory (locale test data). 2017-02-06 17:54:59 +00:00
LICENSE initial commit 2016-09-08 20:43:35 +02:00
README.md Removed outdated description. 2017-01-28 13:34:50 +00:00
crawler Fixed getopt setup. 2017-06-19 06:03:36 +01:00
create_db Improved logging. 2017-06-19 18:41:29 +01:00
crx-tool Renaming. 2017-03-19 16:34:45 +00:00

README.md

ExtensionCrawler

A collection of utilities for downloading and analyzing browser extension from the Chrome Web store.

  • crawler: A crawler for extensions from the Chrome Web Store.
  • permstats.py: A tool for generating statistical data from the crawled extensions.
  • crx-tool.py: A tool for analyzing and extracting *.crx files (i.e., Chrome extensions). Calling crx-tool.py <extension>.crx will check the integrity of the extension.

All utilities are written in Python 3.x. The following non-standard modules might be required:

  • requests (apt-get install python3-requests)
  • dateutil (apt-get install python3-dateutil)
  • jsmin (apt-get install python3-jsmin)

Team

License

This project is licensed under the GPL 3.0 (or any later version).