Training a LeNet for Image Classification with TorchUncertainty#

In this tutorial, we will train a LeNet classifier on the MNIST dataset using TorchUncertainty. You will discover two of the core tools from TorchUncertainty, namely

the routine: a model wrapper, which handles the training and evaluation logics, here for classification
the datamodules: python classes, which provide the dataloaders used by the routine

1. Loading the utilities#

First, we have to load the following utilities from TorchUncertainty:

the TUTrainer which mostly handles the link with the hardware (accelerators, precision, etc)
the classification training & evaluation routine from torch_uncertainty.routines
the datamodule handling dataloaders: MNISTDataModule from torch_uncertainty.datamodules
the model: lenet from torch_uncertainty.models
an optimization recipe in the torch_uncertainty.optim_recipes module.

from pathlib import Path

from torch import nn

from torch_uncertainty import TUTrainer
from torch_uncertainty.datamodules import MNISTDataModule
from torch_uncertainty.models.classification.lenet import lenet
from torch_uncertainty.optim_recipes import optim_cifar10_resnet18
from torch_uncertainty.routines import ClassificationRoutine

2. Creating the Trainer and the DataModule#

In the following, we first create the trainer and instantiate the datamodule that handles the MNIST dataset, dataloaders and transforms.

trainer = TUTrainer(accelerator="gpu", devices=1, max_epochs=2, enable_progress_bar=False)

# datamodule providing the dataloaders to the trainer
root = Path("data")
datamodule = MNISTDataModule(root=root, batch_size=128)

3. Instantiating the Model#

We create the model easily using the blueprint from torch_uncertainty.models.

model = lenet(
    in_channels=datamodule.num_channels,
    num_classes=datamodule.num_classes,
    dropout_rate=0.4,
)

4. The Loss and the Routine#

This is a classification problem, and we use CrossEntropyLoss as the (negative-log-)likelihood. We define the training routine using the classification routine from torch_uncertainty.routines. We provide the number of classes, the model, the optimization recipe, the loss, and tell the routine that our model is an ensemble at evaluation time with the is_ensemble flag.

routine = ClassificationRoutine(
    num_classes=datamodule.num_classes,
    model=model,
    loss=nn.CrossEntropyLoss(),
    optim_recipe=optim_cifar10_resnet18(model),
    is_ensemble=True,
)

5. Gathering Everything and Training the Model#

We can now train the model using the trainer. We pass the routine and the datamodule to the fit and test methods of the trainer. It will automatically evaluate uncertainty metrics that you will find in the table below.

trainer.fit(model=routine, datamodule=datamodule)
results = trainer.test(model=routine, datamodule=datamodule)

  0%|          | 0.00/9.91M [00:00<?, ?B/s]
  1%|          | 65.5k/9.91M [00:00<00:22, 431kB/s]
  3%|▎         | 295k/9.91M [00:00<00:09, 1.06MB/s]
  7%|▋         | 721k/9.91M [00:00<00:04, 1.84MB/s]
 16%|█▌        | 1.54M/9.91M [00:00<00:02, 3.60MB/s]
 22%|██▏       | 2.16M/9.91M [00:00<00:01, 4.15MB/s]
 37%|███▋      | 3.70M/9.91M [00:00<00:00, 7.04MB/s]
 65%|██████▌   | 6.49M/9.91M [00:00<00:00, 12.3MB/s]
100%|██████████| 9.91M/9.91M [00:01<00:00, 9.77MB/s]

  0%|          | 0.00/28.9k [00:00<?, ?B/s]
100%|██████████| 28.9k/28.9k [00:00<00:00, 380kB/s]

  0%|          | 0.00/1.65M [00:00<?, ?B/s]
  6%|▌         | 98.3k/1.65M [00:00<00:02, 645kB/s]
 22%|██▏       | 360k/1.65M [00:00<00:01, 1.28MB/s]
 60%|█████▉    | 983k/1.65M [00:00<00:00, 2.96MB/s]
 95%|█████████▌| 1.57M/1.65M [00:00<00:00, 3.66MB/s]
100%|██████████| 1.65M/1.65M [00:00<00:00, 3.07MB/s]

  0%|          | 0.00/4.54k [00:00<?, ?B/s]
100%|██████████| 4.54k/4.54k [00:00<00:00, 8.63MB/s]
/home/chocolatine/actions-runner/_work/_tool/Python/3.11.12/x64/lib/python3.11/site-packages/lightning/pytorch/trainer/connectors/data_connector.py:425: The 'val_dataloader' does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` to `num_workers=19` in the `DataLoader` to improve performance.
/home/chocolatine/actions-runner/_work/_tool/Python/3.11.12/x64/lib/python3.11/site-packages/lightning/pytorch/trainer/connectors/data_connector.py:425: The 'train_dataloader' does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` to `num_workers=19` in the `DataLoader` to improve performance.
/home/chocolatine/actions-runner/_work/_tool/Python/3.11.12/x64/lib/python3.11/site-packages/lightning/pytorch/trainer/connectors/data_connector.py:425: The 'test_dataloader' does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` to `num_workers=19` in the `DataLoader` to improve performance.
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Test metric  ┃      Classification       ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│     Acc      │          87.350%          │
│    Brier     │          0.20035          │
│   Entropy    │          0.73489          │
│     NLL      │          0.44882          │
└──────────────┴───────────────────────────┘
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Test metric  ┃        Calibration        ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│     ECE      │          10.132%          │
│     aECE     │          10.132%          │
└──────────────┴───────────────────────────┘
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Test metric  ┃ Selective Classification  ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│    AUGRC     │          1.964%           │
│     AURC     │          2.347%           │
│  Cov@5Risk   │          81.860%          │
│  Risk@80Cov  │          4.450%           │
└──────────────┴───────────────────────────┘
┏━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Test metric  ┃        Complexity         ┃
┡━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━┩
│    flops     │          72.10 M          │
│    params    │          44.43 K          │
└──────────────┴───────────────────────────┘

6. Evaluating the Model#

Now that the model is trained, let’s test it on MNIST.

import matplotlib.pyplot as plt
import numpy as np
import torch
import torchvision


def imshow(img) -> None:
    npimg = img.numpy()
    npimg = npimg * 0.3081 + 0.1307  # unnormalize
    plt.imshow(np.transpose(npimg, (1, 2, 0)))
    plt.axis("off")
    plt.tight_layout()
    plt.show()


images, labels = next(iter(datamodule.val_dataloader()))

# print images
imshow(torchvision.utils.make_grid(images[:6, ...], padding=0))
print("Ground truth labels: ", " ".join(f"{labels[j]}" for j in range(6)))

routine.eval()
logits = routine(images)

probs = torch.nn.functional.softmax(logits, dim=-1)

values, predicted = torch.max(probs, 1)
print(
    "LeNet predictions for the first 6 images: ",
    " ".join([str(image_id.item()) for image_id in predicted[:6]]),
)

Ground truth labels:  7 2 1 0 4 1
LeNet predictions for the first 6 images:  7 2 1 0 4 1

Total running time of the script: (0 minutes 22.363 seconds)

Gallery generated by Sphinx-Gallery