Apache Lucene open-source search software
-
Updated
May 29, 2024 - Java
Apache Lucene open-source search software
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
Efficient late-interaction retrieval systems in Julia!
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
Cloud-native vector similarity search and storage with efficient, serverless scale-out
Retrieval and Retrieval-augmented LLMs
A list of multi-vector retrieval resources
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
Elevate user interactions with ChatFAQ: your open-source chatbot solution, offering the full spectrum of ChatGPT capabilities. AI + LLM + CMS
Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
MTEB: Massive Text Embedding Benchmark
Apache Solr open-source search software
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
Text Embedding, Retrieval, Rerank and RAG
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
A full-text article retrieval pipeline for biomedical literature.
cuVS - a library for vector search and clustering on the GPU
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."