Reverse Dependencies of spacy
The following projects have a declared dependency on spacy:
- ChadBot6 — no summary
- chatbot-creator — Python package for creating chatbots.(including DiscordBot).
- chatbotsclient — no summary
- chatintents — ChatIntents automatically clusters and labels short text intent messages.
- chATLAS-Embed — A modular Python package for efficient embedding workflows and PostgreSQL-based vector store management with parent-child relationships.
- chatsapi — The World's Fastest AI Agent Framework. Based on SBERT & SpaCy Transforms.
- chatterbot — ChatterBot is a machine learning, conversational dialog engine
- chatty-goose — A conversational passage retrieval toolkit
- ChelsiAI — ChelsiAI is python library to build your own AI virtual assistant with natural language processing.
- chemrel — A project which focuses on automating and transferring chemical data extraction using span categorization and relation extraction models.
- chima-rufus — no summary
- ChitraGupt — ChitraGupt: A powerful OSINT framework with a plugin-based architecture.
- chronos_ai — no summary
- chunkifyr — Your ultimate toolkit for text chunking.
- ckanext-attribution — A CKAN extension that adds support for complex attribution.
- CLaF — CLaF: Clova Language Framework
- claim-processor — Claim Processor provides automatic checking pipeline for detecting fine-grained hallucinations generated by Large Language Models.
- claimer — Breaks down a textual paragraph into verifiable claims.
- classification-text-email — compiled packages
- classy-classification — Have you every struggled with needing a Spacy TextCategorizer but didn't have the time to train one from scratch? Classy Classification is the way to go!
- clause-analysis — A library to analyze clauses and classify tenses.
- clause-segmenter — A clause segmenting tool utilising Python's spacy
- cleanmydata — A data cleaning library for text processing
- clementine — 🍊 A sweet little Python package
- clinisift — An NLP tool for parsing, analyzing, and visualizing medical records
- clinitokenizer — Sentence tokenizer for text from clinical notes.
- clinlp — Performant and production-ready NLP pipelines for clinical text written in Dutch
- clip-text-decoder — Generate text captions for images from their CLIP embeddings.
- cliqs — Module provides implementation of multilingual crisis social media summarization model.
- cltk — The Classical Language Toolkit
- cltl.mention-detection — Template component for Leolani
- code-context — no summary
- codegraph-agent — no summary
- codeserializerlib — Python library for the code serializer
- cogexquestgen — Question generator from any text
- cognitivefactory-interactive-clustering — Python package used to apply NLP interactive clustering methods.
- coh-summarizer — Summarizer tool
- cohmetrix-br — A Brazilian-Portuguese version of cohmetrix
- cohmetrix-br-lib — Biblioteca para extração de características linguísticas para o Português Brasileiro.
- colbert-ir — Efficient and Effective Passage Search via Contextualized Late Interaction over BERT
- collocater — Package for retrieving collocations from text with Spacy
- column-classifier — A column classifier using spaCy for entity recognition.
- cometaNLP — A NLP data and text analysis tool for Italian, Dutch, and English social media comments.
- comics-ocr — ComicsOCR is a Python package created for easily distributing OCR models trained for golden age of comics.
- comid — A community identification module for Reddit conversations
- CommenlyzerEngine — no summary
- community-dashboard-plots — no summary
- conc-nouns — conc_nouns
- ConCat-LS — Repository for ConCat: a simple and intuitive method for English lexical substitution.
- concepcy — 💫 SpaCy wrapper for ConceptNet 💫
- concise-concepts — This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity confidence scores!
- ConsistencyBench — Tools and Techniques for Consistency Benchmarking
- conspiracies — Discover and examine conspiracies using natural language processing
- constituent-treelib — A lightweight Python library for constructing, processing, and visualizing constituent trees.
- construct-tracker — Track and measure constructs, concepts or categories in text documents.
- context-cite — Attribute (or cite) statements generated by LLMs back to in-context information.
- contexto — Librería para el procesamiento y análisis de texto con Python
- contextpro — Python library for concurrent text preprocessing
- contextualSpellCheck — Contextual spell correction using BERT (bidirectional representations)
- contract-reviewer — Using NLP to tag contracts across 12 different fields
- convenient-ai — no summary
- convo-engine — Convo: the AI powered chatbot API 2.0
- convo-latest — Convo: the AI powered chatbot API 2.0
- convo-n2 — Convo: the AI powered chatbot API 2.0
- convo-new-version-abdo — Convo: the AI powered chatbot API 2.0
- convo-nl2 — Convo: the AI powered chatbot API 2.0
- convo-nlu — Convo: the AI powered chatbot API 2.0
- convo-nlu2 — Convo: the AI powered chatbot API 2.0
- convo-tt — Convo: the AI powered chatbot API 2.0
- convo5 — Convo: the AI powered chatbot API
- convo6 — Convo: the AI powered chatbot API
- convo7 — Convo: the AI powered chatbot API
- convosense-utilities — A package to extract the email body out of the email text, by removing signature, in order to get accurate sentiment results.
- convoxxx — Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
- copyrightfpd — Created as a part of the 2023 Google Summer of Code project: Reducing Fossology's False Positive Copyrights, the purpose is to be able to predict whether a given copyright output from the Fossology software is a false positive or not.
- coqui-tts — Deep learning for Text to Speech.
- coreferee — Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further languages
- coreftools — Coreference-Resolution library
- corpus-patterns — Building blocks for spacy Matcher patterns
- corpus-preprocess — Utility functions to preprocess Phil. legalese in weasel-based flows.
- corpus-statistics — A spaCy pipeline component for counting tokens a pipeline has seen.
- corpus2alpino — Converts FoLiA and TEI files to Alpino XML files
- corpusparser — Python library for importing and using corpus data in linguistic research
- cpr-sdk — no summary
- cprex — Chemical Properties Relation Extraction
- crfm-helm — Benchmark for language models
- crosslingual-coreference — A multi-lingual approach to AllenNLP CoReference Resolution, along with a wrapper for spaCy.
- csify — Generate code-switched texts from monolingual texts
- cso-classifier — A light-weight Python app for classifying scientific documents with the topics from the Computer Science Ontology (https://cso.kmi.open.ac.uk/home).
- CTApy — Python package for the Conditional Topic Allocation (CTA)
- ctproc — library for processing clinical trials data from clinicaltrials.gov
- Custom-CVParser — A simple resume parser used for extracting information from resumes
- cv-parser-espanol — no summary
- cv-xtractor — A Python package for extracting information from CVs (resumes).
- cve-analyzer — Simple package that given a CVE desription tries to extract useful semantics from it using NLP
- cwordtm — CWordTM - Topic Modeling Toolkit
- cyberspacy — spaCy pipeline component for adding cyber meta data to Doc, Token and Span objects.
- cycontext — ConText algorithm using spaCy for clinical NLP
- dacy — A Danish pipeline trained in SpaCy that has achieved State-of-the-Art performance on all dependency parsing, NER and POS-tagging for Danish
- dadmatools-light — DadmaTools is a Persian NLP toolkit