Commit graph

5 commits

Author SHA1 Message Date
Daoud Clarke
2d554b14e7 Save results to gzip file 2021-12-07 22:10:16 +00:00
Daoud Clarke
2562a5257a Extract locally 2021-12-05 22:25:37 +00:00
Daoud Clarke
c151fe3777 Extract archive info 2021-12-05 21:42:23 +00:00
Daoud Clarke
14817d7657 Optimise imports 2021-12-05 20:38:05 +00:00
Daoud Clarke
312f32bf61 Add common crawl extract script and dependency management with poetry 2021-12-05 20:31:49 +00:00