JulieA
Published © GPL3+

LLaVA Multimodel Image Search

Using the power of LLaVA (Large Language and Vision Assistant) multimodel model to allow you to organise and find your images with ease.

IntermediateFull instructions provided10 hours430
LLaVA Multimodel Image Search

Things used in this project

Hardware components

AMD Radeon Pro W7900 GPU
AMD Radeon Pro W7900 GPU
×1
AMD Ryzen 5 7600X
×1
ASUS Prime B650-PLUS Motherboard
×1
DDR5 32GB 5200mhz Kingston Fury Memory
×1

Software apps and online services

AMD ROCm™ Software
AMD ROCm™ Software
PyTorch-ROCm6.1
Ollama
Python3
SUN-mini Dataset
Replace this dataset with your own images.
EXIFTool
Docker
PhotoPrism

Story

Read more

Schematics

High Level Diagram

High Level Process Flow Diagram of how the Enhanced Multimodal AI Image Search works

Code

AI Photo Finder Github

Code repo for this project

Credits

JulieA
1 project • 0 followers
Contact

Comments

Please log in or sign up to comment.