Reverse Dependencies of torchaudio
The following projects have a declared dependency on torchaudio:
- vsensebox — VSenseBox - Python toolbox for visual sensing.
- waloviz — An interactive audio player with a spectrogram built-in, as a Jupyter widget or as HTML.
- wav2clip — Wav2CLIP: Learning Robust Audio Representations From CLIP.
- wavaugment — no summary
- waveaugment — WaveAugment performs data augmentation on audio data.
- wavencoder — WavEncoder - PyTorch backed audio encoder
- wavmark — AI-Based Audio Watermarking Tool
- waylay-ml-adapter-torch — ML_adapter for torch.
- wdoc — A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (url, pdf, epub, youtube (incl playlist), audio, anki, md, docx, pptx, oe any combination!)
- wepipe — no summary
- wespeaker-nuaazs — Speaker Embedding
- wespeaker-unofficial — Unofficial wespeaker pypi package
- wespeakerruntime — no summary
- whisper-live — A nearly-live implementation of OpenAI's Whisper.
- whisper-timestamped — Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
- whisperer-ml — Go from raw audio to a text-audio dataset with OpenAI's Whisper
- whisperplus — WhisperPlus: A Python library for WhisperPlus API.
- whisperspeech — An Open Source text-to-speech system built by inverting Whisper
- whisperx — Time-Accurate Automatic Speech Recognition using Whisper.
- whisperx-numpy2-compatibility — A compatibility fix to allow whisperx to work with other packages that require numpy>2. Should no longer be required once pull requests on the main package are accepted.
- whisply — Transcribe, translate, annotate and subtitle audio and video files with OpenAI's Whisper ... fast!
- xares — eXtensive Audio Representation and Evaluation Suite
- xplai — xpl.ai client SDK.
- xron — A deep neural network basecaller for nanopore sequencing.
- xtts-api-server — A simple FastAPI server to host XTTSv2
- xumx-slicq-v2 — V2 of my original sliCQT adaptation of Open-Unmix
- xvector-jtubespeech — Pre-trained model for extracting the x-vector (speaker representation vector)
- yaas — A tool to split video soundtracks into separate tracks using OpenUnmix
- YCSAI — AI Training API developed by SphereAX.
- yolo2onnx-extended — YOLOv8 to ONNX Exporter with Pre and Post Processing
- youtube-discussion-tree-api — This is a python API that allows you to obtain the discussion that occurs in the comments of a YouTube video as a tree structure.
- yta-general-utils — Youtube Autónomo general utils are here.
- zerospeech-libriabx — Wrapper package for librilight-abx.
- zizou — Python package to detect anomalies in geoscience time series data
- zoobot — Galaxy morphology classifiers
- zorro-pytorch — Zorro - Pytorch