Reverse Dependencies of sox
The following projects have a declared dependency on sox:
- aniemore — Aniemore (Artem Nikita Ilya EMOtion REcognition) is a library for emotion recognition in voice and text for russian language.
- armory-testbed — Adversarial Robustness Test Bed
- asr-library — Automatic Speech Recognition inference for wav2vec2 models
- AudAugio — Augments audio for machine learning
- audiomate — Audiomate is a library for working with audio datasets.
- basic-pitch — Basic Pitch, a lightweight yet powerful audio-to-MIDI converter with pitch bend detection.
- bol-library — Speech to Text Library for Indic Languages
- brain-ai — Build your own intelligent personal assistant.
- brevitas — Quantization-aware training in PyTorch
- charmory — Adversarial Robustness Evaluation Library
- concrete-ml-extensions-brevitas — Quantization-aware training in PyTorch
- coqui-stt-training — Training code for Coqui STT
- dcase-models — Python library for rapid prototyping of environmental sound analysis systems
- espydio — A command line utility built using python to automate audio file conversions, thereby assisting audio playing on ESP32
- iarahealth-stt-training — Training code for Coqui STT
- kalliope — Kalliope is a modular always-on voice controlled personal assistant designed for home automation.
- lecture-transcriber — A DeepSpeech-based transcriber using DeepSegment to separate sentences in a long audio recording.
- magenta — Use machine learning to create art and music
- magenta-gpu — Use machine learning to create art and music
- manim_onvoice — Manim Onvoice Termux for Manim
- manim-recorder — Manim plugin for recorder
- manim-voiceover — Manim plugin for all things voiceover
- modelscope — ModelScope: bring the notion of Model-as-a-Service to life.
- mutwo.mbrola — mbrola extension for event based framework for generative art
- nemo-asr — Collection of Neural Modules for Speech Recognition
- nemo-toolkit — NeMo - a toolkit for Conversational AI
- nussl — A flexible sound source separation library.
- paddle-parakeet — Speech synthesis tools and models based on Paddlepaddle
- paddleaudio — Speech audio tools based on Paddlepaddle
- paddlespeech — Speech tools and models based on Paddlepaddle
- python-vcon — vCon conversational data container manipulation package
- radtts — RADTTS library
- riffusion — Stable diffusion for real-time music generation.
- rvc-infer — Python wrapper for inference with rvc
- scaper — A library for soundscape synthesis and augmentation
- Shyna-speaks — Shyna Speak package, Phase one
- ShynaBack — Shyna Backend functionality Package
- ShynaProcess — Shyna Backend Functionality Package For phase one processing
- sonar-space — SONAR provides a set of speech and text encoders for multilingual, multimodal semantic embedding.
- sonusai — Framework for building deep neural network models for sound, speech, and voice AI
- soundbook — easily download and merge split online audiobooks
- soxbindings — Python bindings for sox.
- speteval — A useful module
- tictacsync — command for syncing audio video recordings
- torch-conduit — Lightweight framework for dataloading with PyTorch and channeling the power of PyTorch Lightning
- torchdataset — This is a package to handle various kinds of data in a unified way with Pytorch.
- tts-middleware — no summary
- tubedreams — create dream-sequences from your video browsing history
- ultimate-rvc — Ultimate RVC
- voicegen — no summary
1