Reverse Dependencies of torchaudio
The following projects have a declared dependency on torchaudio:
- Pyara — Library for audio classification
- pyBibX — A Bibliometric and Scientometric Library Powered with Artificial Intelligence Tools
- pyclarity — Tools for the Clarity Challenge
- pydisconet — analyzing the co-authorship network of researchers in the field of biology
- pyfoal — Python forced aligner
- pygranso-cpu — PyGRANSO: A PyTorch-enabled port of GRANSO with auto-differentiation
- PyIAAS — Code of IAAS Framework
- pykoopman — Python package for data-driven approximations to the Koopman operator.
- pylaia — no summary
- pyncer — yet a captcha library
- pyniche — An AI Library for Niche Squad
- pyprocessors-silero — text repunctuation and recapitalization for
- pyserini-install — A Python toolkit for reproducible information retrieval research with sparse and dense representations
- pystorm3 — Python implementation of some Brainstorm functions
- pythaiasr — Python Thai ASR
- python-auditory-toolbox — Several simple auditory models in JAX, Numpy and Torch
- python-dataset — no summary
- pyting — Data-Driven Shock Capturing
- pytorch-sdk — no summary
- PyTorchLab — Realize code in AI field with PyTorch and Lightning.
- PytorchWildlife — a PyTorch Collaborative Deep Learning Framework for Conservation.
- pyw2v2 — Simple wav2vec2 wrapper
- qualia-core — Qualia toolchain Core
- questionnaire-mistral — no summary
- QuickTune — Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How
- rapidnlp-datasets — Data pipelines for TensorFlow and PyTorch.
- raw-speech-classification — Trains CNN classifiers from raw speech using Keras and tests them.
- rayleaf — RayLEAF: a flexible, highly-scalable benchmark for federated learning
- rctorch — A Python 3 toolset for creating and optimizing Echo State Networks. This library is an extension and expansion of the previous library written by Reinier Maat: https://github.com/1Reinier/Reservoir
- realtime-client — no summary
- RealTimeSTT — A fast Voice Activity Detection and Transcription System
- reputationsystemstest — Dynamic Structural Modeling and Approximate Bayesian Inference of Online Reputation System
- resemble-enhance — Speech denoising and enhancement with deep learning
- retriv — retriv: A Python Search Engine for Humans.
- rev-reverb — A simplified python packge to interact with the reverb models
- reviutils — A common library frequently used on python
- rhizonet-package — Python package for RhizoNet-package
- riffusion — Stable diffusion for real-time music generation.
- rlmc — Python utils for AI 🚀
- rnaformer — RNAformer
- rob-pitch — Robust pitch prediction using PyTorch
- robobo-emotion — LibrerÃa para detectar emociones en imágenes y audio usando Robobo
- rodan — Advanced Deep Learning Library
- roerich — Change point detection.
- roicat — A library for classifying and tracking ROIs.
- rshf — RS pretrained models in huggingface style
- rsp-drl — Some basic functionalities
- rvc-python — Use RVC via console or python scripts
- sagemaker-huggingface-inference-toolkit — Open source library for running inference workload with Hugging Face Deep Learning Containers on Amazon SageMaker.
- samosila-core — no summary
- scAtlasVAE — scAtlasVAE: a deep learning framework for atlas-scale scRNA-seq integration and analysis
- scButterfly — A versatile single-cell cross-modality translation method via dual-aligned variational autoencoders
- scDiffusion — scDiffusion(Single-Cell graph neural Diffusion) is a physics-informed graph generative model to do scRNA-seq analysis. scDiffusion investigates cellular dynamics utilizing an attention-based neural network.
- schp — Package of Self Correction for Human Parsing
- scpram — scPRAM accurately predicts single-cell gene expression perturbation response based on attention mechanism
- scprint — scPRINT is a Large Cell Model for Gene Network Inference, Denoising and more from scRNAseq data
- scslat — A graph deep learning based tool to align single cell spatial omics data
- scUNAGI — A Python package for UNAGI
- sdab — Khmer Speech To Text Inference API using Wav2Vec2 with Pretrain Model
- seacrowd — no summary
- secretflow — SecretFlow
- seismic-classifier — trace by trace seismic classification
- selfeeg — Self-Supervised Learning for EEG
- semilearn — Unfied Semi-Supervised Learning Benchmark
- senselab — Senselab is a Python package that simplifies building pipelines for speech and voice analysis.
- serm — SERM is a high-performance data-driven gene expression recovery framework.
- sgmse — Speech enhancement model using SGMSE
- shapeaxi — Shape Analysis Exploration and Interpretability
- shazbot — Sound Hierarchy Attribute Zeitgeist Before Oligarchy Take
- ShortGPT — Automating video and short content creation with AI
- shttst — Shmart TTS tools.
- silero — Silero Models: pre-trained enterprise-grade STT / TTS models and benchmarks.
- silero-api-server — A simple FastAPI server to host Silero TTS
- silero-tts — Script over the official Silero so that it can be conveniently and quickly used from the code or from the console
- silero-vad — Voice Activity Detector (VAD) by Silero
- silero-vad-fork — A packaged version of the Silero VAD model
- simple-asr — Wrapper module around wav2vec2 designed for ease of use
- simple-diarizer — Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
- singletrader — a package for backtesting and factor analysis
- siste-test — HyperFetch. A tool to optimize and fetch hyperparameters for your reinforcement learning application.
- skeletorch — Pytorch implementation of skeleton transformer module
- slakh-dataset — Unofficial PyTorch dataset for Slakh
- slg-nimrod — minimal deep learning framework
- sliceguard — A library for detecting critical data slices in structured and unstructured data based on features, metadata and model predictions.
- smartdiffusion — A library for making it easier to work with neural networks
- smiles-featurizers — A python library for extracting molecular SMILES embeddings from language models pre-trained with various objectives and/or architectures.
- snngrow — Third-generation Artificial Intelligence SNN Universal Implementation
- so-vits-svc-fork — A fork of so-vits-svc.
- so-vits-svc-fork-mandarin — A mandarin translation version of a fork of so-vits-svc.
- soarv1 — no summary
- socialED — A Python Library for Social Event Detection
- sonar-space — SONAR provides a set of speech and text encoders for multilingual, multimodal semantic embedding.
- sonusai — Framework for building deep neural network models for sound, speech, and voice AI
- Sound-cls — no summary
- sound-scape-explorer — SoundScapeExplorer
- sound-scape-explorer-test — SoundScapeExplorer
- soundstream — Implementation of SoundStream, an end-to-end neural audio codec
- spaceTree — PyPI package for multi-task label transfer from single-cell refrence data to spatial data
- sparseml — Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
- sparseml-nightly — Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models