Reverse Dependencies of Resiliparse
The following projects have a declared dependency on Resiliparse:
- haruka-parser — A simple HTML Parser
- invisible-rabbit — Scalable Data Preprocessing Tool for Training Large Language Models
- invisible-unicorn — Scalable Data Preprocessing Tool for Training Large Language Models
- nemo-curator — Scalable Data Preprocessing Tool for Training Large Language Models
- openwebmath-text-extract — Text Extractor from OpenWebMath
- Resiliparse — A collection of robust and fast processing tools for parsing and analyzing (not only) web archive data.
1