Wheelodex — pyannote.audio — Reverse Dependencies

Wheelodex » Projects » pyannote.audio » Reverse Dependencies

Reverse Dependencies of pyannote.audio

The following projects have a declared dependency on pyannote.audio:

achatbot — An open source chat bot for voice (and multimodal) assistants
africanwhisper — A framework for fast fine-tuning and API endpoint deployment of Whisper model specifically developed to accelerate Automatic Speech Recognition(ASR) for African Languages.
audio-scribe — A command-line tool for audio transcription with Whisper and Pyannote.
audiotextspeakerchangedetect — A Package to Detect Speaker Change based on Textual Features via LLMs & Rule-Based NLP and Audio Features via Pyannote & Spectral Clustering
audiotranscription — no summary
bmnspeechlib — A fork of speechlib with customizations for our team
clipsai — Clips AI is an open-source Python library that automatically converts long videos into clips
demonstrable-whisperx-service — A standalone service for transcribing audio files using WhisperX
diarize-whisper — Librairie pour la transcription ASR et la diarisation
diart — A python framework to build AI for real-time speech
eternalblue — A diarization package
gogadget — gogadget is a toolkit for producing immersion and priming materials for language learning. It is capable of downloading audio and video files, automatically transcribing subtitles from videos and podcasts, and automatically producing filtered Anki decks with sentence audio / translations / screenshots / definitions.
gryannote — Provide Gradio custom components to make the diarization-based audio annotation process easier
gtech-ariel — Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
insanely-fast-whisper — An insanely fast whisper CLI
knorket-whisper — Speech Recognition plus diarization
medkit-lib — A Python library for a learning health system
mexca — Emotion expression capture from multiple modalities.
miya-speechless — Speechless repo for sales call analysis
neural-sync — A library to standardize the usage of various machine learning models
open-dubbing — AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
openwillis-transcribe — digital health measurement
pafst — Library That Preprocessing Audio For TTS/STT.
pafts — Library That Preprocessing Audio For TTS.
pickpod — Integrated tools to transfer internet audio to text, extract unpopular views, and pick up podcasts for you.
powerset-calibration — Companion package to the 'On the calibration of powerset speaker diarization models' paper published at Interspeech 2024.
psifx — Psychological and Social Interactions Feature Extraction
pyhearingai — Library for transcribing audio conversations with accurate speaker identification
rlmc — Python utils for AI 🚀
s-d-pyannote — A speaker diarization pipeline made with pyannote
scraibe — Transcription tool for audio files based on Whisper and Pyannote
sd-pyannote-v1 — A speaker diarization pipeline made with pyannote
senselab — senselab is a Python package that simplifies building pipelines for speech and voice analysis.
sociaML — sociaML - the Swiss Army knife for audiovisual and textual video feature extraction.
speaker-diarization-pyaudio — A speaker diarization pipeline made with pyannote
speakerbox — Speaker Annotation for Transcripts using Audio Classification
speakerchangedetect — A Package to Detect Speaker Change based on Textual Features via LLMs & Rule-Based NLP and Audio Features via Pyannote & Spectral Clustering
speechbox — Speechbox
speechlib — speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names. This library also contain audio preprocessor functions.
transcribify — A tool for transcribing audio files with optional speaker diarization
TurnVoice — Replaces and translates voices in youtube videos
vanpy — VANPY - Voice Analysis framework in Python
verbatim — high quality multi-lingual speech to text
VocalForge — Your one-stop solution for voice dataset creation
wenbi — A simple tool to make the video, audio, subtitle and video-url (especially youtube) content into a written markdown files with the ability to rewritten the oral expression into written ones, or translating the content into a target language by using LLM.
whisper-pyannote-fusion — Fuse whisper and pyannote results
whisper-run — Whisper with speaker diarization
whisperer-ml — Go from raw audio to a text-audio dataset with OpenAI's Whisper
whisperplus — WhisperPlus: A Python library for WhisperPlus API.
whisperx — Time-Accurate Automatic Speech Recognition using Whisper.
whisperx-numpy2-compatibility — A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no longer be required once pull requests on the main package are accepted.
whisply — Transcribe, translate, annotate and subtitle audio and video files with OpenAI's Whisper ... fast!