Commit graph

1308 commits

Author SHA1 Message Date
Mikkel Denker
a4b21e0778 harmonic centrality 2022-05-23 16:46:58 +02:00
Mikkel Denker
3a4a8a734a distance calculations 2022-05-23 16:26:10 +02:00
Mikkel Denker
910b64c447 simple memory graph store 2022-05-23 10:54:44 +02:00
Mikkel Denker
3b4d1aa929 style+link tags are ignored when extracting text 2022-05-22 12:54:41 +02:00
Mikkel Denker
d41e698319 parse webpage 2022-05-20 17:35:12 +02:00
Mikkel Denker
9ca9a9c5e2 decode encodings other than utf8 and parse metadata 2022-05-20 12:37:50 +02:00
Mikkel Denker
d844a8594c read warc files 2022-05-20 11:56:33 +02:00
Mikkel Denker
450922b906 initial commit 2022-05-19 12:29:54 +02:00