Reverse Dependencies of pyspark
The following projects have a declared dependency on pyspark:
- pysparkpipe — Flow orchestrator for data transformations within the context of pyspark.sql.GroupedData.applyInPandas
- pysparkplus — Pyspark extra functions!
- PySPARQL — SPARQL Result to Spark
- pysparrow — An arrow interface for PySpark RDDs
- pysparta — Library to help ETL using pyspark
- PyStellarDB — Python interface to StellarDB
- pytalog-spark — no summary
- pytd — Treasure Data Driver for Python
- pytest-dbt-core — Pytest extension for dbt.
- pytest-kuunda — pytest plugin to help with test data setup for PySpark tests
- pytestmsfabric — no summary
- pythonhelpers — A collection of Python tools. So far, just a test.
- pythoth — data profiling monitoring platform
- pytispark — TiSpark support for python
- pyveb — Package containing common code and reusable components for pipelines and dags
- pyzeek — Zeek Analysis Tools
- qsmap — Package provides functionality for working with geographical data, routing, and mapping
- quartic-sdk-gsk — QuarticSDK is the SDK package which exposes the APIs to the user
- qwak-core — Qwak Core contains the necessary objects and communication tools for using the Qwak Platform
- radarpipeline — A python feature generation and visualization package use with RADAR-base project data.
- rain-dm — Rain library.
- rainforest-mch — RandomForest QPE python library
- ratschlab-common — Small library of common functionalities used in various projects in the ratschlab
- raydp — RayDP: Distributed Data Processing on Ray
- raydp-nightly — RayDP: Distributed Data Processing on Ray
- rdsa-utils — A suite of PySpark, Pandas, and general pipeline utils for Reproducible Data Science and Analysis (RDSA) projects.
- reagent — Facebook RL
- recommenders — Recommenders - Python utilities for building recommendation systems
- recon-comp — A recon module to compare two databases in UC
- reina — A Causal Inference library for Big Data.
- replay-rec — RecSys Library
- reprompt — Reprompt
- RetailRecom — Simple Recommendation System in Python3+ (Using Collaborative Filtering)
- rikai — no summary
- rosetta_finder — Searching for Rosetta Binaries.
- RosettaPy — A Python utility for wrapping Rosetta command line tools.
- rovio-ingest — no summary
- rtdip-sdk — no summary
- rtg — Reader Translator Generator(RTG), a Neural Machine Translator(NMT) toolkit based on Pytorch
- s1280247-learn2 — This ssoftware is being developd in the Python Class, University of Aizu.
- s1282003-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1282006-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1290166-learn — This software is being developed by Hiroyuki Matushima
- s1290229-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1290242-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1292013-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1300180-learn — This software was made for my class
- s1300257-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- s1300259-learn — This software is being developed at the University of Aizu, Aizu-Wakamatsu, Fukushima, Japan
- sachitpkg — no summary
- sagemaker — Open source library for training and deploying models on Amazon SageMaker.
- sagemaker-data-insights — Data Insights Library for Amazon SageMaker.
- SalesData — A PySpark-based sales data processing tool
- salesforce-merlion — Merlion: A Machine Learning Framework for Time Series Intelligence
- sas-merge — replicate SAS merge function using pyspark
- sber-ld-dbtools — Tools for interacting with LD databases.
- scaledp — ScaleDP is a library for processing documents using Apache Spark and LLMs
- schema-jobs — no summary
- schemon-python-client — no summary
- schemon-python-expectation — no summary
- schemon-python-logger — no summary
- scintilla — Scintilla - Generate DataFrames for property based testing
- scope-py — A lens library which targets usability over rigor.
- scystream-sdk — The official SDK for developing scystream compute blocks
- sdk-seshat-python — Seshat python SDK is a library to help create ML data pipelines.
- sdna-conciliacao-core — Utilitários comuns para projetos de conciliação
- seedspark — Spark ETL Utility Framework
- seipy — Helper functions for data science
- sensor-dataset — Put a description
- sentiment-algorithm — This package use newsreader algorithm to produce buy and sell signals, portfolio, ROI and sharpe ratio for stock #AAPL
- sentimeter — Identify the Emotions in a Given Text , Audio file or Live Speech
- sentry-sdk — Python client for Sentry (https://sentry.io)
- sentry-sdk-pubsub — Python client for Sentry (https://sentry.io) with PubSub support
- serra — Simplified Data Pipelines
- sf-hamilton — Hamilton, the micro-framework for creating dataframes.
- sf-hamilton-sdk — Hamilton SDK for reading and writing to the Hamilton backend APIs that support the UI.
- shap — A unified approach to explain the output of any machine learning model.
- shaperone — Shaperone is a fork of the SHAP library, fixing open issues to improve usability.
- shapicant — Feature selection package based on SHAP and target permutation, for pandas and Spark
- shocker — PySpark extension for dataframe keys and relation algebra DAGs
- shparkley — Scaling Shapley Value computation using Spark
- shrike — Python utilities for compliant Azure machine learning
- sigopt — SigOpt Python API Client
- sigopt-spark — SigOpt Pyspark Integration
- sim4rec — Simulator for recommendation algorithms
- simscore — This software is being developed at the Madanapalle Institute of Technology & Science.
- simtools — A set of MapReduce programs to process brain images
- singer-target-iomete — Singer.io target for loading data to iomete
- skyboxremote — A python library for controlling a sky box
- smartnoise-eval — Evaluation of differentially private tabular data
- SnowML — Openweb's ML utility package
- soda-core-spark-df — no summary
- soda-spark — Soda SQL API for PySpark data frame
- spae — Sparked Parallel Aggregation Concentrated Engine
- spalah — Spalah is a set of PySpark dataframe helpers
- sparglim — sparglim
- spark-ai-cli — SparkAI on CLI
- spark-board — Interactive visualization of Spark jobs
- spark-calibration — Calibratiing model scores/probabilites with pyspark dataframes
- spark-column-analyzer — A package for analyzing Spark DataFrame columns