Reverse Dependencies of torchaudio
The following projects have a declared dependency on torchaudio:
- monviso — MoNvIso is a comprehensive software tool designed for the analysis and modeling of protein isoforms. It automates the process of identifying canonical and additional isoforms, assessing their modeling propensity, mapping mutations accurately, and building structural models of proteins.
- mosamaticdesktop — Desktop tool for analyzing medical images
- mountaintop — make research work more friendly
- moveread-pipelines-tatr — Moveread Table Transformer pipeline
- MovieChat — Long video understanding
- mozyq — no summary
- MPoL — Regularized Maximum Likelihood Imaging for Radio Astronomy
- mqtts-lightning — Add a short description here
- msa-toolbox — MSA Toolbox
- msclap — CLAP (Contrastive Language-Audio Pretraining) is a model that learns acoustic concepts from natural language supervision and enables “Zero-Shot” inference. The model has been extensively evaluated in 26 audio downstream tasks achieving SoTA in several of them including classification, retrieval, and captioning.
- mu-alpha-zero-library — Library for running and training MuZero and AlphaZero models.
- muko — 加速实现AIGC、自动办公的中文编程工具
- mushan — Personal toolkit.
- musiclm-pytorch — MusicLM - AudioLM + Audio CLIP to text to music synthesis
- mvits — VITS toolkit on Pytorch
- mw-adapter-transformers — A friendly fork of HuggingFace's Transformers, adding Adapters to PyTorch language models
- naeural-core — Naeural Core is the backbone of the Naeural Edge Protocol.
- naivenlp-datasets — Data pipelines for TensorFlow and PyTorch.
- narizaka — Tool to make high quality text to speech (tts) corpus from audio + text books.
- nataili — Nataili: Multimodal AI Python Library
- ned — entity linking, named-entity disambiguation, record linkage
- neural-homomorphic-vocoder — Pytorch implementation of neural homomorphic vocoder
- neural-sync — A library to standardize the usage of various machine learning models
- neuralpp — Neural Probabilistic Programs (NeuralPPs)
- neurobench — Collaborative, Fair, and Representative Benchmarks for Neuromorphic Computing
- neuroimage_denoiser — no summary
- neutone-sdk — SDK for wrapping deep learning models for usage in the Neutone plugin
- neweraai — NewEraAI - New Era Artificial Intelligence
- nlpatl — Natural language processing active learning library for deep neural networks
- nn-trainer — Neural Network Trainer
- nn-wrapper — A wrapper class for neural networks that makes working with them easier.
- nocode-autonn — An AutoML framework for deep learning
- NodeCoder — A PyTorch implementation of NodeCoder pipeline, a Graph Convolutional Network (GCN) framework for protein residue characterization.
- nomad-audio — Perceptual similarity embeddings for non-matching reference audio quality assessment and speech enhancement
- nonebot-plugin-vits-tts — nonebot-plugin-vits-tts
- npVCC2016 — npvcc2016: Python loader of npVCC2016 speech corpus
- npyx — Python routines dealing with Neuropixels data.
- nusacrowd — no summary
- NViXTTS — Deep learning for Vietnamese Text to Speech
- oceanai — OCEAN-AI
- Ocrversion1 — no summary
- Ocrversion2 — no summary
- oct-tissuemasking — A PyTorch based package for automated OCT tissue masking.
- oct-vesselseg — A Label-Free and Data-Free Synthesis Engine and Training Framework for Vascular Segmentation of sOCT Data with PyTorch.
- odcommonapp — Object detection common framework application
- okwugbe — Automatic Speech Recognition Library for African Languages
- OmniSenseVoice — OmniSenseVoice
- open-metric-learning — OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
- openav — OpenAV
- openchatkit — OpenChatKit - a powerful, open-source base to create both specialized and general purpose chatbots
- openclip-service — CLIP Service Network Application - service part
- openpom — Open-source Principal Odor Map models for Olfaction
- openretina — Open source retina model architectures and training setups
- openunmix — PyTorch-based music source separation toolkit
- openvoice-cli — Use OpenVoice 2 stage via console or python scripts
- openwakeword — An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity
- optimum — Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
- optimum-deepsparse — Optimum DeepSparse is an extension of the Hugging Face Transformers library that integrates the DeepSparse inference runtime. DeepSparse offers GPU-class performance on CPUs, making it possible to run Transformers and other deep learning models on commodity hardware with sparsity. Optimum DeepSparse provides a framework for developers to easily integrate DeepSparse into their applications, regardless of the hardware platform.
- optimum-intel — Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality.
- outspeed — no summary
- ovos-audio-transformer-plugin-speechbrain-langdetect — A speech lang detection plugin for mycroft
- ovos-audio-transformer-plugin-speechbrain-voxlingua107 — A speech lang detection plugin for mycroft
- ovos-stt-plugin-wav2vec — A wav2vec stt plugin for OVOS
- paddlespeech — Speech tools and models based on Paddlepaddle
- pamai — Package for PAMAI written by Arthur Zucker and Chris Rauch.
- pandora-llm — Red-teaming large language models for train data leakage
- par_yt2text — Extracts metadata about a video, such as the transcript, duration, and comments, with optional audio transcription using OpenAI Whisper.
- ParametricSpectralClustering — A library for users to use parametric spectral clustering
- ParticleDetection — Tools to track particles with machine learning.
- PathwayOracle — An LLM-empowered, KG-Driven, Gene Pathway Analysis Tool
- pdf2index — no summary
- pegasusX — pegasus - Pytorch
- penn — Pitch Estimating Neural Networks (PENN)
- personalitylinmult — PersonalityLinMulT: Transformer-based Big Five Automatic Personality Perception.
- pesto-pitch — Efficient pitch estimation with self-supervised learning
- pgsocr — A command line utility for converting Blu-ray subs to SRT or ASS using AI Language Models.
- phasefinder — rotational beat estimation model
- phenonaut — A toolkit for multiomic phenotypic space exploration.
- phlearn — A package for simulating and learning pseudo-Hamiltonian systems. For further details, see https://arxiv.org/pdf/2206.02660.pdf and https://arxiv.org/abs/2304.14374
- photobox — A guizero application to image insect sticky plates
- pickpod — Integrated tools to transfer internet audio to text, extract unpopular views, and pick up podcasts for you.
- pipepal — PipePal is a Python package that simplifies building pipelines for speech and voice analysis.
- pixano-inference — Inference models for Pixano, data-centric AI building blocks for computer vision applications
- PLAID-X — Efficient and Effective Passage Search via Contextualized Late Interaction over BERT and XLM-RoBERTa
- plixkws — Plug-and-Play Multilingual Few-shot Spoken Words Recognition
- podgrab — CLI tool for downloading and transcribing podcasts.
- polaritymodel — A package for running the cell polarity model
- porthamiltonians — A package for simulating and learning port-Hamiltonian systems. For further details, see https://arxiv.org/pdf/2206.02660.pdf
- ppgs — Phonetic posteriorgrams
- promonet — Prosody Modification Network
- prompt-hyperopt — Reliable language-model prompts through templates, calibration, and hyperparameter optimization
- provision-ai — AI experiment provisioner
- psauron — A tool to assess protein coding gene annotation
- psifx — Psychological and Social Interactions Feature Extraction
- pteredactyl — Pteredactyl performs free-text redaction and masking of peronally identifiable information (PII) in clinical free text. It can be deployed as an API from a container or as a python module
- pva-resimagenet — NN based on U-Net and DenseNet for image restoration
- pva-resimagenet-app — An application that uses ResImageNet to restore old photos.
- py-awesome-scripts — Awesome python script tools.
- py-data-juicer — A One-Stop Data Processing System for Large Language Models.
- pyannote.audio — Neural building blocks for speaker diarization