Reverse Dependencies of sentence-transformers
The following projects have a declared dependency on sentence-transformers:
- testzeus-hercules — Hercules: The World's First Open-Source AI Agent for End-to-End Testing
- tetra-textual-trust-analyzer — TETRA: TExtual TRust Analyzer
- text-analyzer-library — A library for processing and analyzing text with embeddings and visualizations
- text-embeddings-10d — Text embeddings package with multiple model support
- text-eval-benchmark — A Python package to compute text similarity using Sentence Transformers
- text-explainability — Generic explainability architecture for text machine learning models
- text-tagging-model — Here we collected some online and offline models for text tagging.
- text-to-action — A system that translates natural language queries into programmatic actions
- text2mapviewer — A python package to map your own csv files data using Atlas from NOMIC
- text2music — A package for text-to-music generation
- text2topicloss — Text2topic loss for bi-encoder models
- textattack — A library for generating text adversarial examples
- textdiversity — A family of textual diversity metrics
- texteller — A meta-package for installing dependencies
- textembed — TextEmbed provides a robust and scalable REST API for generating vector embeddings from text. Built for performance and flexibility, it supports various sentence-transformer models, allowing users to easily integrate state-of-the-art NLP techniques into their applications. Whether you need embeddings for search, recommendation, or other NLP tasks, TextEmbed delivers with high efficiency.
- textmatchertoolkit — In summary, this package provides a set of utility functions and matching algorithms that can be used to preprocess, analyze, and match text data in various applications such as natural language processing, information retrieval, and text similarity analysis.
- textoir — TEXTOIR is the first high-quality Text Open Intent Recognition platform.
- TextRegress — A package for performing advanced regression on text data using unified deep learning framework.
- texture-viz — Process and profile text datasets interactively
- textweaver — A FastAPI-based web server for working with LLMs, embedding models, and Pinecone Vector DB.
- thepipe-api — Document extraction, powered by multimodal LLMs.
- TherapeuticNLP — A multi-dimensional Natural Language Processing (NLP) framework for analyzing and assessing the quality of therapeutic conversations
- thesaurus-lib — Implemented thesaurus library using SOM
- thext — THExt - Transformer-based Highlights Extraction
- tibo — CLI tool for codebase indexing and natural language retrieval.
- tinyhnsw — no summary
- tknlp — no summary
- tldrai — CLI tool to give quick step-by-step answers to tech questions. Powered by the LLMs and StackOverflow.
- tokenlearn — Pre-train Static Embedders
- tokenwise — Tokenizer evaluation library
- ToolAgents — ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.
- toolva — SosiAl Media Bigdata Analysis service by pcn
- top2vec — Top2Vec learns jointly embedded topic, document and word vectors.
- topic-autolabel — Automatic topic labeling using LLMs
- topic-benchmark — CLI suite for benchmarking topic models
- topicgpt — A package for integrating LLMs like GPT-3.5 and GPT-4 into topic modelling
- topicgpt_python — Official implementation of TopicGPT: A Prompt-based Topic Modeling Framework (NAACL'24)
- topmost — Topmost: A Topic Modeling System Toolkit
- toponymy — A library for using large language models to name topics
- torchness — PyTorch tools
- TPE-Bot — A TPEdu All-in-one Chatbot Package
- TR-BIAS — Python package to idenitfy bias from a corpus
- transformertopic — Topic modeling using sentence_transformer
- transfusion — Transformers 🤝 diffusion
- translation-quality-estimator — To estimate the quality of translation
- TrendFlow — A tool for literature research and analysis
- triple-encoders — Distributed Sentence Transformer Representations with Triple Encoders
- trust_eval — Metric to measure RAG responses with inline citations
- trustworthyai-text — SDK API to assess text Machine Learning models.
- TruthTorchLM — TruthTorchLM is an open-source library designed to assess truthfulness in language models' outputs. The library integrates state-of-the-art methods, offers comprehensive benchmarking tools across various tasks, and enables seamless integration with popular frameworks like Huggingface and LiteLLM.
- turbo-alignment — turbo-alignment repository
- turftopic — Topic modeling with contextual representations from sentence transformers.
- tweets-to-topic-network — start from a set of tweets and create a multilayer network where each layer is a topic
- twembeddings — event detection in tweets
- txagent — TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
- txtai — All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
- uglypy — A Python package for aggregating and processing RSS feeds with LLM-enhanced content rewriting.
- uhsr — Unified Hyperbolic Spectral Retrieval (UHSR) - a novel text retrieval algorithm combining lexical and semantic search.
- unbound-gateway — Python client library for the Unbound API
- unifai — Unify AI clients into a single interface with enhanced Tool Calling support.
- uniteai — AI, Inside your Editor.
- unitxt — Load any mixture of text to text data in one line of code
- unstructured-ingest — A library that prepares raw documents for downstream ML tasks.
- URAG — Build your own chatbot in no time!
- useb — Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper.
- vatrix — Log Processor & SBERT Training Tool
- vdf-io — This library uses a universal format for vector datasets to easily export and import data from all vector databases.
- vec2text — convert embedding vectors back to text
- vecspace — VecSpace.
- vecstream — A lightweight, efficient vector database with similarity search capabilities
- vector-cache — A streamlined Python library that enhances LLM query performance through semantic caching, making responses faster and more cost-effective.
- vector-nest — A package for text similarity and embeddings
- vector-pipelines — Create scalable vector search powered pipelines with ease
- VectorCode — A tool to vectorise repositories for RAG.
- vectordb2 — A lightweight Python package for storing and retrieving text using chunking, embedding, and vector search
- vectorhub — One liner to encode data into vectors with state-of-the-art models using tensorflow, pytorch and other open source libraries. Word2Vec, Image2Vec, BERT, etc
- vectorhub-nightly — One liner to encode data into vectors with state-of-the-art models using tensorflow, pytorch and other open source libraries. Word2Vec, Image2Vec, BERT, etc
- vectorlite — Lightweight vector database.
- VectorMD — no summary
- vectrs — Decentralized & Distributed Vector Database
- vecworks — Framework to procedurally query vector stores
- vembed — Package providing methods to create Vector Embeddings from Strings, calculate similarities between lists of Strings, and Generate Visualizations such as Heatmaps from simple Lists.
- vesslflow — VESSLFlow
- vidore-benchmark — Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
- vital-ai-vitalsigns — VitalSigns knowledge graph bindings
- vital-llm-reasoner — Vital LLM Reasoner
- vnow-aider-chat — Vnow Fork of Aider is AI pair programming in your terminal
- vokab — vokab: named entity linking through hybrid (lexical and semantic) search engine.
- VoPho — An easy to use Multilingual phonemization meta-library
- vptq — VPTQ (Vector Post-Training Quantization) is a novel Post-Training Quantization method.
- vsql — no summary
- wa-analyzer — Code for the Master of Applied Data Science course Data Analysis and Visualization
- waldiez — waldiez
- weak-annotators — Weak annotators for information extraction (NER)
- weave — A toolkit for building composable interactive data driven applications.
- Webiks-Hebrew-RAGbot — A search engine using machine learning models and Elasticsearch for advanced document retrieval.
- webleaf — HTML DOM Tree Leaf Structure Identification Package
- webllama — Llama-powered agents for automatic web browsing
- websocietysimulator — Web Social Simulator for WWW'25 AgentSociety Challenge
- webwright — Webwright: The Ghost in Your Shell 👻💻