A Python application that extracts text from PDF files, generates embeddings using Google's Gemini API, stores them in ChromaDB, and performs K-Means clustering to organize documents by similarity.