search
Semantic file search using ImageBind and sqlite-vec
Installation
pip install -r requirements.txt
If you don't need PyTorch with GPU support, first run pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
. During runtime, if you get the error ModuleNotFoundError: No module named 'torchvision.transforms.functional_tensor'
, change functional_tensor
to functional
.
Usage
This program uses a client-server architecture to watch directories with inotify and keep the model loaded in memory. It takes around 15 seconds to load the model so it wouldn't be great if every query had to wait on that.
Run python server.py DIRS_TO_INDEX
to start the server. The server only indexes images, audio, and videos since text files tend to irrelevantly pollute the search results.
Then run python client.py SEARCH_TEXT NUM_RESULTS
to get a list of the most similar files. You can pass this list to an image viewer such as Gwenview to view image results. Note that Gwenview doesn't preserve the order of the images. Alternatively, add a third parameter to python client.py
and it will symlink res0
, res1
, and so on to the files on the list. This can be used in conjunction with the Dolphin file manager's integrated terminal to get thumbnails of the search results.
Check out TIDY for a similar awesome program that runs on Android.