Reverse Dependencies of pydub
The following projects have a declared dependency on pydub:
- easymix — Simple live and track python audio mixer
- easyplayer — Easyplayer is a python library that encapsulates the complex API of pygame2 to help users build games faster.
- ebook2audiobook — Convert eBooks to Audiobooks using a Text-to-Speech model with optional Gradio interface.
- efb-telegram-master — Telegram Master Channel for EH Forwarder Bot, based on Telegram Bot API.
- efb-voice-recog-middleware — WeChat Middleware for EH Forwarder Bot to convert voice to text, based on Baidu and Microsoft API.
- efb-wechat-comwechat-slave — EFB Slave for WeChat on ComWeChat
- elan-scissors — Cut ELAN audio into snippets.
- elixir-client — Elixir client enables remote execution of python code triggered from a Crucible Plugin on the Signals & Sorcery platform.
- emotion_detective — This package provides functions to analyze emotions in video or audio files. It offers a comprehensive set of tools to detect and analyze emotions at a sentence level, producing valuable insights into the emotion-content in videos and audios.
- EmotiVoice — EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- Endeless — Create playlists with seamless music transitions.
- esa-ai — An Artificial Intelligence Automation Platform. AI Instruction management from various providers, has an adaptive memory, and a versatile plugin system with many commands including web browsing. Supports many AI providers and models and growing support every day.
- eternalblue — A diarization package
- EtmA2T — A small example package
- EtmAudio2Text — A small example package
- etripy — 인공지능 기술을 체험할 수 있는 공공 인공지능 오픈 API Wrapper
- ezlocalai — ezlocalai is an easy to set up local multimodal artificial intelligence server with OpenAI Style Endpoints.
- fajrGPT — A Python application to assist in waking up for Fajr prayer by providing 3 interactive verses/explanations from the Quran + ChatGPT explanations accompanied by a soothing Islamic prayer fade-in and fade-out audio file from YouTube.
- farm-haystack-text2speech — Haystack node to convert text entities (documents, answers, etc...) into audio files.
- fast-tts — no summary
- FastDub — A Python CLI package for voice over subtitles, with the ability to embed in video, audio ducking, and dynamic voice changer for a single track; auto translating; download and upload to YouTube supports
- fat-llama — fat_llama is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes CUDA-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT (Fast Fourier Transform), resulting in richer and more detailed audio.
- fat-llama-fftw — fat_llama_fftw is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes cpu-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT (Fast Fourier Transform), resulting in richer and more detailed audio.
- fbgradio — Python library for easily interacting with trained machine learning models
- Feluda — no summary
- ffmpeg-python-utils — Python scripts constructing ffmpeg commands and running them by subprocess.
- ffmpegaudiorecord — Starts recording audio from the specified device using ffmpeg and stops recording after a specified number of seconds of silence
- fftrack — FFTrack is a Python-based music recognition tool that allows users to identify songs from audio input.
- fibs-reporter — Automatically generate a pdf report containing feature importance, baseline modelling, spurious correlation detection, and more, from a single command line input
- filemac — Open source Python CLI toolkit for conversion, manipulation, Analysis of files (All major file operations)
- first-shell — The first shell for every littel nerd
- fixie — Fixie.ai SDK for Python. Enables you to build AI-powered voice applications.
- flac2mp3-cli — A Python CLI tool to convert FLAC to MP3
- flashvideo — flashvideo is a lightweight framework for accelerating large video diffusion models.
- fmdpy — Music Downloader
- forcealign — A Python library for forced alignment of English text to English audio.
- fortepyan — Process MIDI piano with (almost) no pain
- foxlator-lib — Library backend for foxlator
- framewise-secureline — A client library for content security detection
- freedvtnc2 — no summary
- freem-bots — Abstractions over Discord with simple TTS voice support
- freeselcall — no summary
- friday-d33pster — Hey Siri and Ok Google type Voice Assistant implemented in python+rust, Iron Man style!
- ftis — The finding things in stuff package.
- funkwhale-api — no summary
- furchain — FurChain is an innovative toolkit for creating and interacting with digital personas, complete with voice cloning and role-playing capabilities. It offers a suite of tools for real-time voice manipulation, chatbot creation, and text-based RPG adventures, all while being open-source and operable offline.
- g4f-xn — library for easy access to GPT models based on g4f
- GailBot — GailBot API
- GalaxyBalaxyUpload — no summary
- Gao-Anime — AI Head
- geminiSH — Tool based on Google`s Gemini to work on diferents code bases.
- genaibook — Utilities for 'Hands-On Generative AI with Transformers and Diffusion Models' (upcoming)
- GeneralAgent — General Agent: From LLM to Agent
- generalkit — easy tool kit
- GenerIter — A package for Generative Iterative music composition.
- geniusrise-audio — audio bolts for geniusrise
- gianna — Generative Intelligent Artificial Neural Network Assistant
- gideonai — An intelligent bot written in Python
- gm-pymms — python xmms inspired media player/recorder
- godork — Scrape Google search quickly
- goodbyecaptcha — An asynchronized Python library to automate solving ReCAPTCHA v2 by images/audio
- gptspeak — Text-to-speech CLI tool and Python library using OpenAI's TTS API
- gradio — Python library for easily interacting with trained machine learning models
- gradio-frp — Python library for easily interacting with trained machine learning models
- gradio-test-client-pypi — Python library for easily interacting with trained machine learning models
- gradio-test-pypi — Python library for easily interacting with trained machine learning models
- graphite-datasets — tensorflow/datasets is a library of datasets ready to use with TensorFlow.
- gtech-ariel — Google EMEA gTech Ads Data Science Team's solution to automatically translate and dub video ads into multiple languages using AI.
- h2ogpt — no summary
- Harmonify — Harmonify: A project fork of RVC V2
- harmonizer — Harmonizes your audio media: converts, normalizes, enriches and validates.
- hindikosh — Hindi corpus reader
- hlvox — Pieces together voice files to form sentences
- hume — A Python SDK for Hume AI
- icad-tone-detection — A Python library for extracting scanner radio tones from scanner audio.
- IITD-speech-vone — Useful speech recognition and transcription related library for Indian languages.
- import-a — A folder of functions and classes that are easy to import
- incognitoGPT — no summary
- infinite-context — no summary
- insynth — Domain-specific generation of test inputs for robustness testing of ML models
- intentional-terminal — Plugin that makes Intentional able to use a local CLI to communicate
- inter-morse — Interpret wav audio files to Morse code and translate it.
- ipy-agent — An AI assistant designed to be integrated in an IPython shell.
- jakym — Just Another Konsole YouTube-Music
- jarvis-toolkit — Jarvis Toolkit
- jonbot — a friendly machine 🤖❤️✨
- journal-club — Weighted scheme to choose a person to present in a Journal Club
- jrvc — Libraries for RVC inference
- jsr-fun — no summary
- kaldi-helpers — Scripts for preparing language data for use with Kaldi ASR
- kaldigrpc-client — Python client for Kaldi GRPC server
- karaoke-generator — Fully automated creation of _acceptable_ karaoke music videos from any music on YouTube, using open source tools and AI (e.g. Whisper and MDX-Net)
- kazquakersudp — Tools for receiving and interacting with Raspberry Shake UDP data
- key-bpm-renamer — A tool to rename audio files based on their key and BPM
- khnlp — Khmer NLP Toolkits
- kibernikto — Easily run telegram bots connected to AI models.
- klay-beam — Toolkit for massively parallel audio processing via Apache Beam
- knowledgegpt — A package for extracting and querying knowledge using GPT models
- krutrim-cloud — The official Python library for the Krutrim Cloud API
- kur — Descriptive deep learning