Commit Graph

1051 Commits

Author SHA1 Message Date
Achim D. Brucker c90bfb5fbb Added support for xz compressed archives. 2018-11-10 20:48:00 +00:00
Achim D. Brucker 2289e8ecc2 Removed reporting of WorkerExceptions. 2018-11-10 16:50:13 +00:00
Achim D. Brucker dd08cdb5a6 Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-10-29 19:29:02 +00:00
Achim D. Brucker 17b0ab53e8 Increased requests version (dependency). 2018-10-29 19:28:44 +00:00
Michael Herzberg 81b2bbd21f Added MD5Table and made max simhash dist configurable. 2018-09-12 21:56:30 +01:00
Achim D. Brucker 3b75b839f0 Added SPDX identifier. 2018-09-03 00:30:58 +01:00
Achim D. Brucker 19bccc8be1 Reverted fix for double reporting of downloads. 2018-09-02 20:18:09 +01:00
Michael Herzberg 63fe0806ee Fixed double-logging when not using forkserver. 2018-09-02 11:46:27 +01:00
Achim D. Brucker cb974b6ddf Hack: divide download numbers by two to adapt to log format changes after disabling forkserver. 2018-09-02 11:11:33 +01:00
Achim D. Brucker 9a33bfdd9e Added SPDX identifier. 2018-09-02 10:25:46 +01:00
Achim D. Brucker 4ab2f4d5a3 Added SPDX identifier. 2018-09-02 10:25:41 +01:00
Achim D. Brucker 3d2fb6c054 Added SPDX identifier. 2018-09-02 10:24:52 +01:00
Achim D. Brucker c9ec991cfd Added SPDX identifier. 2018-09-02 10:22:56 +01:00
Achim D. Brucker c7419a2d9f Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-09-01 21:52:02 +01:00
Achim D. Brucker 6d8ee12bce Added SPDX identifier. 2018-09-01 21:48:14 +01:00
Achim D. Brucker eb18d051dc Added support for compressed archives. 2018-09-01 21:47:19 +01:00
Michael Herzberg e71323644a Don't use forkserver anymore since it seems to lead to many sem_unlink/file not found exceptions. 2018-09-01 11:34:54 +01:00
Michael Herzberg 7d313790a7 Changed simhashbucket to use MySQL instead of SQLite. 2018-08-31 11:10:32 +01:00
Achim D. Brucker a66620bcc7 Added plot of extension archive size. 2018-08-31 10:49:14 +01:00
Achim D. Brucker 449216907e Increased y-scale to match larger number of extensions (and max download speed). 2018-08-31 10:30:13 +01:00
Achim D. Brucker d7496c8c56 Report WorkerException warnings. 2018-08-19 21:29:07 +01:00
Achim D. Brucker 3cd1b221fd Added reporting of WorkerExceptions. 2018-08-17 22:08:40 +01:00
Michael Herzberg 2d52091292 Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-08-17 15:20:09 +01:00
Michael Herzberg 873c249504 Build list for simhash lazily to save memory. 2018-08-17 15:20:00 +01:00
Michael Herzberg e7b7625453 Added database documentation. 2018-08-09 14:20:01 +01:00
Michael Herzberg e492f516ac Restrict sharc jobs to 1 hour. 2018-08-02 16:13:25 +01:00
Michael Herzberg 66db569d5f Only open DB connection when needed. 2018-08-02 12:37:57 +01:00
Michael Herzberg 05c1cbdea5 Give MySQL server up to 1 hour to recover. 2018-08-02 11:47:38 +01:00
Michael Herzberg c6d3056fee Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-07-28 19:02:48 +02:00
Michael Herzberg 947ecf50d4 Removed queue length and reduced mysql insert batch size. 2018-07-28 19:02:38 +02:00
Achim D. Brucker 1ee699cc0f Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-07-28 10:48:09 +01:00
Achim D. Brucker 5783eb8f27 Optimized image size. 2018-07-28 10:46:24 +01:00
Michael Herzberg 4592cba9b2 Actually return n new ids when discovering. 2018-07-28 10:32:16 +02:00
Michael Herzberg 45a8486f69 Fixed small bug. 2018-07-27 16:39:42 +02:00
Achim D. Brucker 540e45ed4d Added cleanup hook to apt configuration. 2018-07-27 12:52:51 +01:00
Michael Herzberg 85a5645763 Ignore GPU nodes, use less RAM. 2018-07-26 18:38:11 +02:00
Achim D. Brucker d7160e50db Added SPDX identifier. 2018-07-21 12:13:54 +01:00
Achim D. Brucker 86523c840c Added master repository URL. 2018-07-21 12:12:59 +01:00
Michael Herzberg 67b7a46543 Add support for provided extension ids. 2018-07-21 02:15:18 +01:00
Michael Herzberg eb616b0ac3 Fix some encoding issues. 2018-07-21 01:50:59 +01:00
Michael Herzberg 250bdd2c6b Bundle mysql inserts. 2018-07-19 23:26:25 +01:00
Michael Herzberg a1d866d0ff Overwrite last_updated on duplicate. 2018-07-17 14:06:43 +01:00
Michael Herzberg a6173fe23e Don't look for etags in the DB anymore. 2018-07-16 19:19:26 +01:00
Michael Herzberg 4b5cc276ee Added option to use INSERT DELAYED with create-db. 2018-07-16 19:14:24 +01:00
Michael Herzberg 1857ec7b75 Don't restart processes (hopefully mitigates semlock rebuilding error). 2018-07-16 11:21:42 +01:00
Michael Herzberg 8bc4e8fa37 Cache etags in applications. 2018-07-16 01:04:27 +01:00
Michael Herzberg 595f0f8759 Use 16 threads to discover new extensions. 2018-07-15 19:19:26 +01:00
Michael Herzberg c9e66186ef Log duration of tar append. 2018-07-15 19:15:20 +01:00
Michael Herzberg c2eaa10bcb Merge branch 'master' of logicalhacking.com:BrowserSecurity/ExtensionCrawler 2018-07-15 00:57:27 +01:00
Michael Herzberg 3bef0afe7a Group mysql inserts and don't compress them. 2018-07-15 00:08:11 +01:00