Reverse Dependencies of spacy
The following projects have a declared dependency on spacy:
- danlp — DaNLP: NLP in Danish
- danoliterate — Benchmark of Generative Large Language Models in Danish
- data_generation_tool — A library that provides data generation functionality for AI and data science projects
- data-modori — LMOps Tool for Korean
- data-purifier — A Python library for Automated Exploratory Data Analysis, Automated Data Cleaning and Automated Data Preprocessing For Machine Learning and Natural Language Processing Applications in Python.
- data-science-toolbox — Various code to aid in data science projects for tasks involving data cleaning, ETL, EDA, NLP, viz, feature engineering, feature selection, model validation, etc.
- DataCleanerAI — An interactive, intelligent data-cleaning library with ML-based user adaptation
- datafog — Scan, redact, and manage PII in your documents before they get uploaded to a Retrieval Augmented Generation (RAG) system.
- datalabs — Datalabs
- dataPreprocess — Pre process the textual data for NLP and machine learning applications
- dataPreprocessTest — Pre process the textual data for NLP and machine learning applications
- dataQuest — A package to extract hystorical news sentiments
- DataScience-ToolBox — A set of python modules for machine learning and data mining
- datawords — A library to work with text data
- date-spacy — A spaCy extension for enhanced date and number entity recognition and extraction as structured data.
- datto — Data Tools (Dat To)
- dbgpt-ext — Add your description here
- Dbias — Detect, Recognize and de-bias textual data.
- dbnl — The dbnl Python client SDK
- dbpedia-get — dbPedia Concept Linking and Redirect Analysis
- dcss — Utilities for the book Doing Computational Social Science
- ddhi-encoder — Encoding tools for DDHI
- DeBERTa — Decoding enhanced BERT with Disentangled Attention
- decofre — Neural coreference resolution
- decontext — Pipeline for decontextualization of scientific snippets.
- deduplication — Remove duplicate documents via popular algorithms such as SimHash, SpotSig, Shingling, etc.
- deep-ner — Deep-NER: named entity recognizer based on ELMo or BERT as embeddings and CRF as final classifier
- DeeperAI — A comprehensive AI library featuring deep learning, reinforcement learning, computer vision, and more.
- deephub — no summary
- deidentify — De-identify free-text medical records
- demo-it-analyze — no summary
- demo-py — My Personal Demo Toolbox.
- DenSpa — DenSpa is an open-source package designed for hybrid search, enabling seamless integration into RAG frameworks.
- description2process — Library for constructing a process model given the process description. Deep learning techniques are implmented as much as possible.
- detector-worker — Worker class for incapsulating logic, required for Lionbridge Rnd detectors
- dev-laiser — LAiSER (Leveraging Artificial Intelligence for Skill Extraction & Research) is a tool designed to help learners, educators, and employers extract and share trusted information about skills. It uses a fine-tuned language model to extract raw skill keywords from text, then aligns them with a predefined taxonomy. You can find more technical details in the project’s paper.md and an overview in the README.md.
- dev-work-tracker — This is to track developers bug rate
- dexflex — Spacy plugin working based on dexonline database
- dffml-model-spacy — DFFML model dffml-model-spacy
- dffml-operations-nlp — DFFML operations nlp
- DFProcessor — Preprocessing tools for pandas dataframe
- dframcy — Pandas Dataframe integration for spaCy
- dialog-reflection — A library for dialog systems that attempt to respond to messages as Reflective Listening.
- dialogy — Dialogy is a library for building and managing SLU applications.
- dianna — Deep Insight And Neural Network Analysis
- disaggregators — HuggingFace community-driven open-source library for dataset disaggregation
- Distinct-Keywords — no summary
- ditat-etl — Multiple tools and utilities for ETL pipelines and others.
- django-markov — django-markov is a reusable Django app that enables you to create Markov text models, and store them in the database. Those models can then be used to generate Markov chain sentences.
- django-ner-trainer — Tools for training spaCy Named Entity Recognition models in Django
- dltkai — Python Client for DLTK.
- docanalysis — extract structured information from ethics paragraphs
- docassemble.ALWeaver — no summary
- doccano-client — A simple client for doccano API.
- doccano-transformer — Format transformer tool for doccano
- docdoc — A tool to handle documents
- docrx — search in documents
- Docs2KG — Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models
- docs2tops — Takes a list of documents and returns fully automated & labeled dictionaries where topic names are keys and semantically similar keywords from the documents as values
- doctran — Document transformation framework for vector based retrieval
- document-processing — Pre-process documents for Natural Language Processing using spaCy models
- DocumentAI-std — The main standards for Latis Document AI project
- DocumentInsightsGenerator — A package to generate comprehensive insights from documents using NLP techniques.
- docuscospacy — Support for spaCy models trained on DocuScope and the CLAWS7 tagset
- donew — A Python package for web processing and vision tasks with browser automation capabilities
- dose-instruction-parser — Tool for parsing free text prescription dose instructions into structured output
- dotagent — no summary
- dotagent-dev — no summary
- dotams — no summary
- dotnext — no summary
- dphon — Tools and algorithms for phonology-aware Early Chinese NLP.
- dragon-prep — Preprocessing scripts for the DRAGON benchmark
- drivescanner — Scan your filesystem to look for files that are a potential GDPR risk
- dronnai — Neural Networks Training Center by Dronn.com
- dsbundle — Streamline your data science setup with dsbundle in one effortless install.
- dseqmap4nlp — A small tool to parse and process annotated text corpora
- dsl2 — An implementation of Semantic labeling: A domain-independent approach
- dsp-ml — Demonstrate-Search-Predict
- dspy-ml — Demonstrate-Search-Predict
- dstl — DataSet TransLation (DSTL) provides utilities to translate annotated natural language data from one language to another.
- dsu — no summary
- dunkin-ai-assistant — no summary
- DunkinDonut — no summary
- dutch-text-analytics — Dutch Text Analytics is a versatile toolkit designed to facilitate the exploration, execution, and validation of a diverse range of Natural Language Processing (NLP) tasks specifically tailored for the Dutch language. This repository provides a comprehensive set of tools, including code examples, scripts, and resources, to enhance and streamline your Dutch NLP projects.
- dwdsmor — SFST/SMOR/DWDS-based German morphology
- dxi-nlp — no summary
- dynamicfluency — The base python package for DynamicFluency: Monitor and understand the dynamicity of linguistic aspects in (L2) speech.
- e2eml — An end-to-end solution for automl
- EaglePick — Human-readable programming language for web, mobile, and backend applications
- easy-transformers — Utils for dealing with transformers
- easySum — You can easily summarize the text.
- ector — Extract from a given long text input, eCommerce products and price budget.
- edsnlp — Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailored support for French clinical notes.
- edu-convokit — Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
- ehrnote — Data Science Toolkit for Healthcare Note
- electivegroup — A simple resume parser used for extracting information from resumes
- elemental-tools — A Collection of Utilities. Not even can be described.
- elfen — ELFEN - Efficient Linguistic Feature Extraction for Natural Language Datasets
- elucidoc — Screens legal and other texts for sentences and clauses containing user defined search phrases
- email-chunking — this is a email chunker