{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Score and Predict Large Datasets\n",
"================================"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"Sometimes you'll train on a smaller dataset that fits in memory, but need to predict or score for a much larger (possibly larger than memory) dataset. Perhaps your [learning curve](http://scikit-learn.org/stable/modules/learning_curve.html) has leveled off, or you only have labels for a subset of the data.\n",
"\n",
"In this situation, you can use [ParallelPostFit](http://ml.dask.org/modules/generated/dask_ml.wrappers.ParallelPostFit.html) to parallelize and distribute the scoring or prediction steps."
]
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {
"execution": {
"iopub.execute_input": "2021-01-14T10:49:35.142635Z",
"iopub.status.busy": "2021-01-14T10:49:35.139795Z",
"iopub.status.idle": "2021-01-14T10:49:36.450683Z",
"shell.execute_reply": "2021-01-14T10:49:36.451836Z"
}
},
"outputs": [
{
"data": {
"text/html": [
"