optimus

Agile Data Science Workflows made easy with PySpark.

optimus repository preview
What You'll Learn

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Technologies & Topics
big-data-cleaning
bigdata
cudf
dask
dask-cudf
data-analysis
data-cleaner
data-cleaning
data-cleansing
data-exploration
data-extraction
data-preparation
data-profiling
data-science
data-transformation
data-wrangling
machine-learning
pyspark
spark
Getting Started

1. Clone the Repository

git clone https://github.com/hi-primus/optimus.git

2. Follow the README

Check the repository's README file for specific setup instructions, dependencies, and usage examples.

3. Explore and Learn

Study the code structure, run the examples, and experiment with modifications to deepen your understanding.

Repository Info
Language
Python
Stars
1,512
Published
7/13/2017
CategoryPython
More in Python