Repo for my PyCon Colombia 2024 talk called Embeddings is all you need: Artificial Intelligence for text, audio and images. See all the info about the event, clic here
Currently working on it! 🎯⚒️⚙️💡
At their core, embeddings are a way to represent words, phrases, or even entire documents as vectors of real numbers in a continuous vector space. This representation captures the semantic meaning of the text, allowing similar words to have similar representations. Popular techniques to create embeddings include Word2Vec, GloVe (Global Vectors for Word Representation), and more recently, contextual embeddings like those from deep neural networks.
In this repo and talk we are going to explore how to use them to develop applications based on AI with the following 3 uses cases.
This implementation is based on RAG (Retrieval Augmented Generation) architechture which uses LLMs, Embeddings, Vector Databases and Prompt Engineering to build a Chatbot for the PyCon Colombia 2024 as next:
This implementation is a version on Reverse Image Search. Using ViT (Vision Transformer) models, Embeddings and Vector Databases to can search product into a web fashion store based on a image of a product as input and retrieve the most similar products.
This implementation is based on a Music recommender. Using PANNs (Pretrained Audio Neural Networks) models, Embeddings and Vector Databases to can search and recommend songs similar to your favorites songs. You enter a YouTube URL of a song and it fetch recommendations.