PyCon Colombia 2024

Repo for my PyCon Colombia 2024 talk called Embeddings is all you need: Artificial Intelligence for text, audio and images. See all the info about the event, clic here

Embeddings is all you need: Artificial Intelligence for text, audio and images.

Currently working on it! 🎯⚒️⚙️💡

At their core, embeddings are a way to represent words, phrases, or even entire documents as vectors of real numbers in a continuous vector space. This representation captures the semantic meaning of the text, allowing similar words to have similar representations. Popular techniques to create embeddings include Word2Vec, GloVe (Global Vectors for Word Representation), and more recently, contextual embeddings like those from deep neural networks.

In this repo and talk we are going to explore how to use them to develop applications based on AI with the following 3 uses cases.

1. Chat with your own data. The PyCon Colombia 2024 ChatBot 🤖

This implementation is based on RAG (Retrieval Augmented Generation) architechture which uses LLMs, Embeddings, Vector Databases and Prompt Engineering to build a Chatbot for the PyCon Colombia 2024 as next:

2. Web Store that allows to find similar products based on a image 🔎

This implementation is a version on Reverse Image Search. Using ViT (Vision Transformer) models, Embeddings and Vector Databases to can search product into a web fashion store based on a image of a product as input and retrieve the most similar products.

3. Music Recommender Web App: Find songs similar to your favorite songs 🎵

This implementation is based on a Music recommender. Using PANNs (Pretrained Audio Neural Networks) models, Embeddings and Vector Databases to can search and recommend songs similar to your favorites songs. You enter a YouTube URL of a song and it fetch recommendations.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
assets		assets
data/unstructured		data/unstructured
notebooks		notebooks
qdrant_storage		qdrant_storage
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
dog.png		dog.png
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
test.ipynb		test.ipynb

License

jolual2747/pycon-colombia-2024

Folders and files

Latest commit

History

Repository files navigation

PyCon Colombia 2024

Embeddings is all you need: Artificial Intelligence for text, audio and images.

1. Chat with your own data. The PyCon Colombia 2024 ChatBot 🤖

2. Web Store that allows to find similar products based on a image 🔎

3. Music Recommender Web App: Find songs similar to your favorite songs 🎵

About

Topics

Resources

License

Stars

Watchers

Forks

Languages