Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Hybrid Vector and Traditional Search #416

Open
KSemenenko opened this issue Apr 19, 2024 · 0 comments
Open

[Feature Request] Hybrid Vector and Traditional Search #416

KSemenenko opened this issue Apr 19, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@KSemenenko
Copy link
Contributor

Context / Scenario

I want a new feature that lets us use date filters for searching documents. For example, if I search for "documents before 2021 with information on LLM," the system should first filter out documents from before 2021, then search these documents for 'LLM'. This mix of vector and regular search would help us find exactly what we need much faster.

The problem

Right now, our system only lets us filter documents by tags. We can't search documents by date for example, which makes it hard to find older documents quickly.

Proposed solution

  1. Document Model Update:

    • Modify the document model to include various data types like text and date.
    • Example: { "documentId": "doc123", "content": "Here's the content", "publishDate": "2020-12-01" }
  2. Data Indexing:

    • Index documents by both content and metadata. For vector data, process text through an embedding model.
  3. Query Processing:

    • Create a parser to extract filters like dates from user queries and separate them from vector search terms.
    • Example query: "Find documents before 2021 about renewable energy."
  4. Search Execution:

    • First, apply traditional filters (e.g., date). Then, within those results, perform a vector-based search.
    • Use tools like Azure AI Search or Elasticsearch to handle both aspects.
  5. Result Handling:

    • Combine and display results, ensuring they meet both content relevance and specific property filters.

Importance

would be great to have

@KSemenenko KSemenenko added the enhancement New feature or request label Apr 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant