Workflow Examples#
Scaling up Hyperparameter Optimization with Kubernetes and XGBoost GPU Algorithm
library/xgboost library/optuna library/dask tools/dask-kubernetes library/scikit-learn workflow/hpo platforms/kubeflow platforms/kubernetes
Scaling up Hyperparameter Optimization with Multi-GPU Workload on Kubernetes
library/xgboost library/optuna library/dask library/dask-kubernetes library/scikit-learn workflow/hpo dataset/nyc-taxi data-storage/gcs data-format/csv platforms/kubeflow platforms/kubernetes
Getting Started with Optuna and RAPIDS for HPO
library/optuna library/dask workflow/hpo library/cuml library/numpy dataset/bnp-claims
Running RAPIDS Hyperparameter Experiments at Scale on Amazon SageMaker
cloud/aws/sagemaker workflow/hpo library/cudf library/cuml library/scikit-learn data-format/csv data-storage/s3
Deep Dive into Running Hyper Parameter Optimization on AWS SageMaker
cloud/aws/sagemaker workflow/hpo library/xgboost library/cuml library/cupy library/cudf library/dask data-storage/s3 data-format/parquet
Multi-node Multi-GPU Example on AWS using dask-cloudprovider
cloud/aws/ec2-multi library/cuml library/dask library/numpy library/dask-ml library/cudf workflow/randomforest tools/dask-cloudprovider data-format/csv data-storage/gcs
Autoscaling Multi-Tenant Kubernetes Deep-Dive
cloud/gcp/gke tools/dask-operator library/cuspatial library/dask library/cudf data-format/parquet data-storage/gcs platforms/kubernetes
HPO with dask-ml and cuml
dataset/airline library/numpy library/pandas library/xgboost library/dask library/dask-cuda library/dask-ml library/cuml cloud/aws/ec2 cloud/azure/azure-vm cloud/gcp/compute-engine cloud/ibm/virtual-server library/sklearn data-storage/s3 workflow/hpo
Train and Hyperparameter-Tune with RAPIDS on AzureML
cloud/azure/ml library/cudf library/cuml library/randomforest workflow/hpo
Perform Time Series Forecasting on Google Kubernetes Engine with NVIDIA GPUs
cloud/gcp/gke tools/dask-operator workflow/hpo workflow/xgboost library/dask library/dask-cuda library/xgboost library/optuna data-storage/gcs platforms/kubernetes
HPO Benchmarking with RAPIDS and Dask
cloud/aws/ec2 data-storage/s3 workflow/randomforest workflow/hpo workflow/xgboost library/dask library/dask-cuda library/xgboost library/optuna library/sklearn library/dask-ml
Training XGBoost with Dask RAPIDS in Databricks
library/dask library/dask-cudf library/xgboost library/dask-deltatable library/dask-databricks library/dask-ml workflow/xgboost dataset/higgs data-format/csv data-storage/databricks-delta-lake platforms/databricks
Multi-Node Multi-GPU XGBoost Example on Azure using dask-cloudprovider
cloud/azure/azure-vm-multi tools/dask-cloudprovider library/cudf library/cuml library/xgboost library/dask library/fil data-storage/azure-data-lake dataset/nyc-taxi workflow/xgboost
Measuring Performance with the One Billion Row Challenge
tools/dask-cuda data-format/csv library/cudf library/cupy library/dask library/pandas cloud/aws/ec2 cloud/aws/sagemaker cloud/azure/azure-vm cloud/azure/ml cloud/gcp/compute-engine cloud/gcp/vertex-ai
Getting Started with cudf.pandas and Snowflake
library/cudf platforms/snowflake