Wheelodex — torchaudio — Reverse Dependencies

Wheelodex » Projects » torchaudio » Reverse Dependencies

Reverse Dependencies of torchaudio

The following projects have a declared dependency on torchaudio:

monviso — MoNvIso is a comprehensive software tool designed for the analysis and modeling of protein isoforms. It automates the process of identifying canonical and additional isoforms, assessing their modeling propensity, mapping mutations accurately, and building structural models of proteins.
mosamaticdesktop — Desktop tool for analyzing medical images
mountaintop — make research work more friendly
moveread-pipelines-tatr — Moveread Table Transformer pipeline
MovieChat — Long video understanding
mozyq — no summary
MPoL — Regularized Maximum Likelihood Imaging for Radio Astronomy
mqtts-lightning — Add a short description here
msa-toolbox — MSA Toolbox
msclap — CLAP (Contrastive Language-Audio Pretraining) is a model that learns acoustic concepts from natural language supervision and enables “Zero-Shot” inference. The model has been extensively evaluated in 26 audio downstream tasks achieving SoTA in several of them including classification, retrieval, and captioning.
mu-alpha-zero-library — Library for running and training MuZero and AlphaZero models.
muko — 加速实现AIGC、自动办公的中文编程工具
mushan — Personal toolkit.
musiclm-pytorch — MusicLM - AudioLM + Audio CLIP to text to music synthesis
mvits — VITS toolkit on Pytorch
mw-adapter-transformers — A friendly fork of HuggingFace's Transformers, adding Adapters to PyTorch language models
naeural-core — Naeural Core is the backbone of the Naeural Edge Protocol.
naivenlp-datasets — Data pipelines for TensorFlow and PyTorch.
narizaka — Tool to make high quality text to speech (tts) corpus from audio + text books.
nataili — Nataili: Multimodal AI Python Library
ned — entity linking, named-entity disambiguation, record linkage
neural-homomorphic-vocoder — Pytorch implementation of neural homomorphic vocoder
neural-sync — A library to standardize the usage of various machine learning models
neuralpp — Neural Probabilistic Programs (NeuralPPs)
neurobench — Collaborative, Fair, and Representative Benchmarks for Neuromorphic Computing
neuroimage_denoiser — no summary
neutone-sdk — SDK for wrapping deep learning models for usage in the Neutone plugin
neweraai — NewEraAI - New Era Artificial Intelligence
nlpatl — Natural language processing active learning library for deep neural networks
nn-trainer — Neural Network Trainer
nn-wrapper — A wrapper class for neural networks that makes working with them easier.
nocode-autonn — An AutoML framework for deep learning
NodeCoder — A PyTorch implementation of NodeCoder pipeline, a Graph Convolutional Network (GCN) framework for protein residue characterization.
nomad-audio — Perceptual similarity embeddings for non-matching reference audio quality assessment and speech enhancement
nonebot-plugin-vits-tts — nonebot-plugin-vits-tts
npVCC2016 — npvcc2016: Python loader of npVCC2016 speech corpus
npyx — Python routines dealing with Neuropixels data.
nusacrowd — no summary
NViXTTS — Deep learning for Vietnamese Text to Speech
oceanai — OCEAN-AI
Ocrversion1 — no summary
Ocrversion2 — no summary
oct-tissuemasking — A PyTorch based package for automated OCT tissue masking.
oct-vesselseg — A Label-Free and Data-Free Synthesis Engine and Training Framework for Vascular Segmentation of sOCT Data with PyTorch.
odcommonapp — Object detection common framework application
okwugbe — Automatic Speech Recognition Library for African Languages
OmniSenseVoice — OmniSenseVoice
open-metric-learning — OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
openav — OpenAV
openchatkit — OpenChatKit - a powerful, open-source base to create both specialized and general purpose chatbots
openclip-service — CLIP Service Network Application - service part
openpom — Open-source Principal Odor Map models for Olfaction
openretina — Open source retina model architectures and training setups
openunmix — PyTorch-based music source separation toolkit
openvoice-cli — Use OpenVoice 2 stage via console or python scripts
openwakeword — An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity
optimum — Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
optimum-deepsparse — Optimum DeepSparse is an extension of the Hugging Face Transformers library that integrates the DeepSparse inference runtime. DeepSparse offers GPU-class performance on CPUs, making it possible to run Transformers and other deep learning models on commodity hardware with sparsity. Optimum DeepSparse provides a framework for developers to easily integrate DeepSparse into their applications, regardless of the hardware platform.
optimum-intel — Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
outspeed — no summary
ovos-audio-transformer-plugin-speechbrain-langdetect — A speech lang detection plugin for mycroft
ovos-audio-transformer-plugin-speechbrain-voxlingua107 — A speech lang detection plugin for mycroft
ovos-stt-plugin-wav2vec — A wav2vec stt plugin for OVOS
paddlespeech — Speech tools and models based on Paddlepaddle
pamai — Package for PAMAI written by Arthur Zucker and Chris Rauch.
pandora-llm — Red-teaming large language models for train data leakage
par_yt2text — Extracts metadata about a video, such as the transcript, duration, and comments, with optional audio transcription using OpenAI Whisper.
ParametricSpectralClustering — A library for users to use parametric spectral clustering
ParticleDetection — Tools to track particles with machine learning.
PathwayOracle — An LLM-empowered, KG-Driven, Gene Pathway Analysis Tool
pdf2index — no summary
pegasusX — pegasus - Pytorch
penn — Pitch Estimating Neural Networks (PENN)
personalitylinmult — PersonalityLinMulT: Transformer-based Big Five Automatic Personality Perception.
pesto-pitch — Efficient pitch estimation with self-supervised learning
pgsocr — A command line utility for converting Blu-ray subs to SRT or ASS using AI Language Models.
phasefinder — rotational beat estimation model
phenonaut — A toolkit for multiomic phenotypic space exploration.
phlearn — A package for simulating and learning pseudo-Hamiltonian systems. For further details, see https://arxiv.org/pdf/2206.02660.pdf and https://arxiv.org/abs/2304.14374
photobox — A guizero application to image insect sticky plates
pickpod — Integrated tools to transfer internet audio to text, extract unpopular views, and pick up podcasts for you.
pipepal — PipePal is a Python package that simplifies building pipelines for speech and voice analysis.
pixano-inference — Inference models for Pixano, data-centric AI building blocks for computer vision applications
PLAID-X — Efficient and Effective Passage Search via Contextualized Late Interaction over BERT and XLM-RoBERTa
plixkws — Plug-and-Play Multilingual Few-shot Spoken Words Recognition
podgrab — CLI tool for downloading and transcribing podcasts.
polaritymodel — A package for running the cell polarity model
porthamiltonians — A package for simulating and learning port-Hamiltonian systems. For further details, see https://arxiv.org/pdf/2206.02660.pdf
ppgs — Phonetic posteriorgrams
promonet — Prosody Modification Network
prompt-hyperopt — Reliable language-model prompts through templates, calibration, and hyperparameter optimization
provision-ai — AI experiment provisioner
psauron — A tool to assess protein coding gene annotation
psifx — Psychological and Social Interactions Feature Extraction
pteredactyl — Pteredactyl performs free-text redaction and masking of peronally identifiable information (PII) in clinical free text. It can be deployed as an API from a container or as a python module
pva-resimagenet — NN based on U-Net and DenseNet for image restoration
pva-resimagenet-app — An application that uses ResImageNet to restore old photos.
py-awesome-scripts — Awesome python script tools.
py-data-juicer — A One-Stop Data Processing System for Large Language Models.
pyannote.audio — Neural building blocks for speaker diarization

1 2 3 4 5 6 7