mwmbl/mwmbl
2022-12-24 19:59:56 +00:00
..
crawler Store the best items, not the worst ones 2022-07-31 22:55:15 +01:00
indexer Make it easier to rum mwmbl locally 2022-12-07 20:01:31 +00:00
resources Add new LTR model 2022-08-09 22:47:59 +01:00
tinysearchengine Make it easier to rum mwmbl locally 2022-12-07 20:01:31 +00:00
__init__.py renamed package to mwmbl 2021-12-28 12:35:46 +01:00
background.py Split out URL updating from indexing 2022-08-26 22:20:35 +01:00
database.py Fix issue #60 2022-07-10 11:10:03 +02:00
hn_top_domains_filtered.py Don't include web.archive.org as a curated domain 2022-07-04 15:44:28 +01:00
main.py Make it easier to rum mwmbl locally 2022-12-07 20:01:31 +00:00
retry.py Make more robust 2022-06-21 08:44:46 +01:00
settings.py Exclude a domain 2022-12-24 19:59:56 +00:00
tokenizer.py Use terms and bigrams from the beginning of the string only 2022-08-26 17:20:11 +01:00
url_queue.py Use an in-memory queue 2022-07-31 00:43:58 +01:00
utils.py Use an in-memory queue 2022-07-31 00:43:58 +01:00