Reverse Dependencies of pyannote.audio
The following projects have a declared dependency on pyannote.audio:
- achatbot — An open source chat bot for voice (and multimodal) assistants
- africanwhisper — A framework for fast fine-tuning and API endpoint deployment of Whisper model specifically developed to accelerate Automatic Speech Recognition(ASR) for African Languages.
- audiotextspeakerchangedetect — A Package to Detect Speaker Change based on Textual Features via LLMs & Rule-Based NLP and Audio Features via Pyannote & Spectral Clustering
- audiotranscription — no summary
- bmnspeechlib — A fork of speechlib with customizations for our team
- clipsai — Clips AI is an open-source Python library that automatically converts long videos into clips
- demonstrable-whisperx-service — A standalone service for transcribing audio files using WhisperX
- diart — A python framework to build AI for real-time speech
- eternalblue — A diarization package
- gogadget — gogadget is a toolkit for producing immersion and priming materials for language learning. It is capable of downloading audio and video files, automatically transcribing subtitles from videos and podcasts, and automatically producing filtered Anki decks with sentence audio / translations / screenshots / definitions.
- gryannote — Provide Gradio custom components to make the diarization-based audio annotation process easier
- gtech-ariel — Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
- insanely-fast-whisper — An insanely fast whisper CLI
- ipevo-python-diarization — IPEVO python diarization module
- knorket-whisper — Speech Recognition plus diarization
- medkit-lib — A Python library for a learning health system
- mexca — Emotion expression capture from multiple modalities.
- miya-speechless — Speechless repo for sales call analysis
- neural-sync — A library to standardize the usage of various machine learning models
- open-dubbing — AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
- openwillis-transcribe — digital health measurement
- pafts — Library That Preprocessing Audio For TTS.
- pickpod — Integrated tools to transfer internet audio to text, extract unpopular views, and pick up podcasts for you.
- powerset-calibration — Companion package to the 'On the calibration of powerset speaker diarization models' paper published at Interspeech 2024.
- psifx — Psychological and Social Interactions Feature Extraction
- rlmc — Python utils for AI 🚀
- s-d-pyannote — A speaker diarization pipeline made with pyannote
- scraibe — Transcription tool for audio files based on Whisper and Pyannote
- sd-pyannote-v1 — A speaker diarization pipeline made with pyannote
- senselab — Senselab is a Python package that simplifies building pipelines for speech and voice analysis.
- sociaML — sociaML - the Swiss Army knife for audiovisual and textual video feature extraction.
- speaker-diarization-pyaudio — A speaker diarization pipeline made with pyannote
- speakerbox — Speaker Annotation for Transcripts using Audio Classification
- speakerchangedetect — A Package to Detect Speaker Change based on Textual Features via LLMs & Rule-Based NLP and Audio Features via Pyannote & Spectral Clustering
- speechbox — Speechbox
- speechlib — speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with actual speaker names. This library also contain audio preprocessor functions.
- transcribify — A tool for transcribing audio files with optional speaker diarization
- TurnVoice — Replaces and translates voices in youtube videos
- verbatim — high quality multi-lingual speech to text
- VocalForge — Your one-stop solution for voice dataset creation
- whisper-pyannote-fusion — Fuse whisper and pyannote results
- whisper-run — Whisper with speaker diarization
- whisperer-ml — Go from raw audio to a text-audio dataset with OpenAI's Whisper
- whisperplus — WhisperPlus: A Python library for WhisperPlus API.
- whisperx — Time-Accurate Automatic Speech Recognition using Whisper.
- whisperx-numpy2-compatibility — A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no longer be required once pull requests on the main package are accepted.
- whisply — Transcribe, translate, annotate and subtitle audio and video files with OpenAI's Whisper ... fast!
1