Reverse Dependencies of evaluate
The following projects have a declared dependency on evaluate:
- 3lc — 3LC Python Package - A tool for model-guided, interactive data debugging and enhancements
- AC-IaC — no summary
- accelerate — Accelerate
- adapters — A Unified Library for Parameter-Efficient and Modular Transfer Learning
- aepo — Annotation Efficient Preference Optimization
- africanwhisper — A framework for fast fine-tuning and API endpoint deployment of Whisper model specifically developed to accelerate Automatic Speech Recognition(ASR) for African Languages.
- agi-med-metrics — Utils for agi-med team metric calculation
- aigpt52 — A Flask-based model training application with authentication.
- aisploit — Tiny package designed to support red teams and penetration testers in exploiting large language model AI solutions.
- al360-trustworthyai-text — SDK API to assess text Machine Learning models.
- alexa-teacher-models — Alexa Teacher Models
- alexandra-ai-eval — Evaluation of finetuned models.
- alignment-handbook — The Alignment Handbook
- ares-ai — ARES is an advanced evaluation framework for Retrieval-Augmented Generation (RAG) systems,
- argilla-v1 — Open-source tool for exploring, labeling, and monitoring data for NLP projects.
- arize — A helper library to interact with Arize AI APIs
- arthur-bench — validate models for production
- assert-llm-tools — Automated Summary Scoring & Evaluation of Retained Text
- AudioAugmentor — Python package for simple application of wide range of audio augmentations.
- autoawq — AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
- autodistill-distilbert — DistilBERT model for use with Autodistill
- autogluon.multimodal — Fast and Accurate ML in 3 Lines of Code
- autogluon-tonyhu-test.multimodal — AutoML for Image, Text, and Tabular Data
- autogoal-transformers — transformers algorithm library wrapper for AutoGOAL
- AutoRAG — Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
- autotrain-advanced — no summary
- autotransformers — a Python package for automatic training and benchmarking of Language Models.
- axlearn — AXLearn
- axolotl — LLM Trainer
- azureml-metrics — Contains the ML and non-Azure specific common code associated with AzureML metrics.
- bellek — My digital memory
- biochatter — Backend library for conversational AI in biomedicine
- bleuscore — A fast bleu score calculator
- bongovaad — no summary
- cehrbert — CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks
- celi-framework — Controller-Embedded Language Interactions - facilitates the entire lifecycle of document processing, from pre-processing and embedding to post-monitoring and quality assessment.
- chatdesk-grouphug — GroupHug is a library with extensions to 🤗 transformers for multitask language modelling.
- chateval — Evaluation Framework for Chatbots in Generative AI
- clarinpl-embeddings — no summary
- clip-text-decoder — Generate text captions for images from their CLIP embeddings.
- codebook-features — Sparse and discrete interpretability tool for neural networks
- codegaze — Debug code generation models
- compromise-marian — Marian model but with two decoders
- ConsistencyBench — Tools and Techniques for Consistency Benchmarking
- cv-parsing — NLP Application to parse RH Curriculum Vitae for the RH department
- danoliterate — Benchmark of Generative Large Language Models in Danish
- DashAI — DashAI: a graphical toolbox for training, evaluating and deploying state-of-the-art AI models.
- datadreamer.dev — Prompt. Generate Synthetic Data. Train & Align Models.
- dataquality — no summary
- dbgpt-hub — DB-GPT-Hub: Text-to-SQL parsing with LLMs
- deepchopper — A Genomic Language Model for Chimera Artifact Detection in Nanopore Direct RNA Sequencing
- diversity — no summary
- dmx-compressor — d-Matrix Compressor
- docai-py — Butler Doc AI
- domino-code-assist — no summary
- dragon-baseline — Baseline training algorithm for the DRAGON Challenge
- dsbundle — Streamline your data science setup with dsbundle in one effortless install.
- easy-testing — Framework for testing
- eis1600 — EIS1600 project tools and utilities
- elpis — A library to perform automatic speech recognition with huggingface transformers.
- espnet — ESPnet: end-to-end speech processing toolkit
- evaluate-supporter — Evaluate supporter
- evidently — Open-source tools to analyze, monitor, and debug machine learning model in production.
- extralit — Open-source tool for accurate & fast scientific literature data extraction with LLM and human-in-the-loop.
- fastrag — An Efficient Retrieval Augmentation and Generation Framework for Intel Hardware.
- fastrepl — Fast Run-Eval-Polish Loop for LLM App
- fedml — A research and production integrated edge-cloud library for federated/distributed machine learning at anywhere at any scale.
- finetuna — no summary
- finetune-eval-harness — Finetune_Eval_Harness
- finetuning-scheduler — A PyTorch Lightning extension that enhances model experimentation with flexible fine-tuning schedules.
- flax-trainer — Flax Trainer
- flexeval — no summary
- fmeval — Amazon Foundation Model Evaluations
- furiosa-llm-models — Furiosa LLM
- fusionsent — FusionSent: A Fusion-Based Multi-Task Sentence Embedding Model
- future-shot — FutureShot: Few-Shot Learning for high-dimensional classification problems
- genaibook — Utilities for 'Hands-On Generative AI with Transformers and Diffusion Models' (upcoming)
- generate-sequences — no summary
- geniusrise-audio — audio bolts for geniusrise
- geniusrise-text — Text bolts for geniusrise
- geniusrise-vision — Huggingface bolts for geniusrise
- giskard — The testing framework dedicated to ML models, from tabular to LLMs
- GMRev — Librería para evaluar sistemas de generación mejorada por recuperación
- grouphug — GroupHug is a library with extensions to 🤗 transformers for multitask language modelling.
- h2ogpt — no summary
- huggingface-tool — Toolkit for managing huggingface models and datasets
- hyfi-ml — HyFI-ML is a Python package that extends the Hydra Fast Interface (HyFI) framework with machine learning capabilities.
- impetus — An awesome tool/library benchmark LLM performance on all kinds of hardware!
- indomain — no summary
- instruct-qa — Empirical evaluation of retrieval-augmented instruction-following models.
- jac-nlp — no summary
- jaseci-ai-kit — no summary
- jaseci-kit — no summary
- jury — Evaluation toolkit for neural language generation.
- keycare — KeyCARE is a Python library designed for the unsupervised keyword extraction from biomedical documents with the use of different algorithms, the classification of the keywords according to their semantic nature, and the extraction of is a relations among those keywords and with other terminologies.
- koai — Korean AI Project
- kogitune — The Kogitune 🦊 LLM Project
- konfuzio-sdk — Konfuzio Software Development Kit
- langfair — LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
- langkit — A language toolkit for monitoring LLM interactions