dask-distributed

Here are 40 public repositories matching this topic...

DataCanvasIO / HyperGBM

A full pipeline AutoML tool for tabular data

sklearn tabular-data xgboost semi-supervised-learning gpu-acceleration gbm lightgbm ensemble-learning dask preprocessing automl distributed-training datacleaning catboost pseudo-labeling dask-distributed rapidsai fullpipeline adversarial-validation

Updated Feb 28, 2024
Python

shauryashaurya / learn-data-munging

Star

Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.

spark jupyter arrow numpy pandas pyspark data-engineering dask ray dask-distributed datafusion polars

Updated Apr 26, 2024
Jupyter Notebook

TimeEval / TimeEval

Star

Evaluation Tool for Anomaly Detection Algorithms on Time Series

benchmarking time-series numpy pandas python3 distributed dask benchmark-framework jupyter-notebooks time-series-analysis anomaly-detection dask-distributed time-series-anomaly-detection

Updated May 22, 2024
Jupyter Notebook

JSybrandt / agatha

Star

AGATHA: Automatic Graph-mining And Transformer based Hypothesis generation Approach

redis spacy dask bert moliere hypothesis-generation dask-distributed scispacy scibert

Updated Jun 17, 2020
Python

modin-project / unidist

Star

Unified Distributed Execution

python multiprocessing mpi distributed ray dask-distributed

Updated May 29, 2024
Python

sulis-hpc / sulis-hpc.github.io

Star

User documentation website for the Sulis tier 2 HPC service. Built using Jekyll.

python c r hpc openmp mpi parallel-computing slurm uk mpi4py fortran90 warwickuni ensemble-methods dask-distributed

Updated May 16, 2024
SCSS

jameslamb / lightgbm-dask-testing

Star

Test LightGBM's Dask integration on different cluster types

docker aws machine-learning lightgbm dask dask-distributed

Updated Jun 19, 2023
Jupyter Notebook

gandalf1819 / NYCOpenData-Profiling-Analysis

Star

Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex

big-data pandas pyspark levenshtein-distance hdfs dask regular-expressions fuzzywuzzy fuzzy-logic data-profiling nyc-opendata modin nyc-311-dataset dask-distributed

Updated Nov 10, 2020
Jupyter Notebook

leosmerling-hopeit / fraud-poc

Star

Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/

machine-learning microservices dask fraud-detection dask-ml dask-distributed nbdev ml-pipelines

Updated Apr 12, 2023
Jupyter Notebook

epiviz / epivizFileServer

Star

Python library to query and transform genomic data from indexed files

api webservice genomics python3 file-server genomic-data-analysis dask-distributed

Updated Aug 6, 2022
Python

gdmarmerola / big-data-ml-training

Star

Code for "Training models when data doesn't fit in memory" post

machine-learning dask dask-ml dask-distributed ml-engineering

Updated Jun 14, 2020
Jupyter Notebook

pyiron / pylammpsmpi

Star

Parallel Lammps Python interface - control a mpi4py parallel LAMMPS instance from a serial python process or a Jupyter notebook

lammps openmpi mpi4py lammps-python-interface dask-distributed

Updated Jun 2, 2024
Python

aws-solutions-library-samples / distributed-compute-on-aws-with-cross-regional-dask

Star

Perform I/O intensive workloads on high-volume data sparsely located across multiple AWS regions through the use of Dask.

dask dask-distributed dask-worker-pools

Updated Oct 23, 2023
TypeScript

elcorto / psweep

Star

Loop like a pro, make parameter studies fun.

python database pandas parameter-estimation dask parameter-search parameter-sweep parameter-scan dask-jobqueue dask-distributed parameter-study computational-experiment

Updated May 14, 2024
Python

pleiszenburg / scherbelberg

Star

HPC cluster deployment and management for the Hetzner Cloud

python cloud deployment hpc high-performance cluster management high-performance-computing dask hpc-clusters hetzner hpc-cluster cluster-management dask-distributed

Updated Feb 11, 2022
Python

comp-dev-cms-ita / dask-remote-jobqueue

Star

A custom dask remote jobqueue for HTCondor.

htcondor dask dask-jobqueue dask-distributed

Updated Feb 29, 2024
Python

LimnoTech / Xarray-DataAccessor

Star

Efficiently read climate/meteorology data into Xarray using Dask for parallelization. Transform the data for your modelling needs.

data-engineering meteorology climate-data dask-distributed xarray-accessor

Updated Apr 3, 2024
Python

lebedov / dask-ml-on-azure-ml

Star

Using Dask-ML on Azure ML

azure-ml dask-ml dask-distributed

Updated Nov 21, 2019
Python

eth-cscs / ipcluster_magic

Star

Magic commands to support running MPI python code as well as multi-node Dask workloads on Jupyter notebooks.

jupyter-notebook ipyparallel mpi4py dask-distributed

Updated Feb 24, 2023
Python

IncubatorShokuhou / dask-tutorial-chinese

Star

Dask tutorial；Dask汉化教程

chinese-translation dask delayed dask-ml dask-distributed dask-array dask-dataframes

Updated Jan 13, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the dask-distributed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dask-distributed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dask-distributed

Here are 40 public repositories matching this topic...

DataCanvasIO / HyperGBM

shauryashaurya / learn-data-munging

TimeEval / TimeEval

JSybrandt / agatha

modin-project / unidist

sulis-hpc / sulis-hpc.github.io

jameslamb / lightgbm-dask-testing

gandalf1819 / NYCOpenData-Profiling-Analysis

leosmerling-hopeit / fraud-poc

epiviz / epivizFileServer

gdmarmerola / big-data-ml-training

pyiron / pylammpsmpi

aws-solutions-library-samples / distributed-compute-on-aws-with-cross-regional-dask

elcorto / psweep

pleiszenburg / scherbelberg

comp-dev-cms-ita / dask-remote-jobqueue

LimnoTech / Xarray-DataAccessor

lebedov / dask-ml-on-azure-ml

eth-cscs / ipcluster_magic

IncubatorShokuhou / dask-tutorial-chinese

Improve this page

Add this topic to your repo