search-engine-stract/scripts/run_dev.py
Mikkel Denker fbc01ad865 summarization using mistral and 'chain-of-density' approach.
the summarization becomes much better if we allow the model to first generate a candidate summarization and then improving on it.
doing the improvement step just once seems to significantly improve the summary.
we also now use an llm (mistral 7b) for the summarisations, as we can then use the same model for multiple tasks and serve it using gpus, thus significantly decreasing the latency.
2024-01-19 11:08:17 +01:00

36 lines
878 B
Python
Executable file
Vendored

#!.venv/bin/python3
import argparse
import subprocess
import os
os.environ["LIBTORCH"] = "libtorch"
os.environ["LD_LIBRARY_PATH"] = "libtorch/lib"
os.environ["DYLD_LIBRARY_PATH"] = "libtorch/lib"
parser = argparse.ArgumentParser()
parser.add_argument("--release", action="store_true")
args = parser.parse_args()
if args.release:
os.environ["STRACT_CARGO_ARGS"] = "--release"
processes = []
processes.append(subprocess.Popen(["just", "dev-api"]))
processes.append(subprocess.Popen(["just", "dev-search-server"]))
processes.append(subprocess.Popen(["just", "dev-webgraph"]))
processes.append(subprocess.Popen(["just", "dev-frontend"]))
processes.append(subprocess.Popen(["just", "dev-llm"]))
# kill processes on ctrl-c
import time
while True:
try:
time.sleep(1)
except KeyboardInterrupt:
for p in processes:
p.kill()
break