📚 This guide explains how to use Weights & Biases (W&B) with YOLOv5 🚀. UPDATED 29 September 2021.
* About Weights & Biases
* First-Time Setup
* Viewing runs
* Disabling wandb
* Advanced Usage: Dataset Versioning and Evaluation
* Reports: Share your work with the world!
Think of W&B like GitHub for machine learning models. With a few lines of code, save everything you need to debug, compare and reproduce your models — architecture, hyperparameters, git commits, model weights, GPU usage, and even datasets and predictions.
Used by top researchers including teams at OpenAI, Lyft, Github, and MILA, W&B is part of the new standard of best practices for machine learning. How W&B can help you optimize your machine learning workflows:
## Disabling wandb
* training after running wandb disabled
inside that directory creates no wandb run
wandb online
You can leverage W&B artifacts and Tables integration to easily visualize and manage your datasets, models and training evaluations. Here are some quick examples to get you started.
$ python train.py --upload_data val

{dataset}_wandb.yaml
file which can be used to train from dataset artifact.
$ python utils/logger/wandb/log_dataset.py --project ... --name ... --data ..

$ python train.py --data {data}_wandb.yaml

$ python train.py --save_period 1

Any run can be resumed using artifacts if the --resume
argument starts with wandb-artifact://
 prefix followed by the run path, i.e, wandb-artifact://username/project/runid
. This doesn't require the model checkpoint to be present on the local system.
$ python train.py --resume wandb-artifact://{run_path}

Local dataset or model checkpoints are not required. This can be used to resume runs directly on a different device
The syntax is same as the previous section, but you'll need to lof both the dataset and model checkpoints as artifacts, i.e, set bot --upload_dataset
or
train from _wandb.yaml
file and set --save_period
$ python train.py --resume wandb-artifact://{run_path}

W&B Reports can be created from your saved runs for sharing online. Once a report is created you will receive a link you can use to publically share your results. Here is an example report created from the COCO128 tutorial trainings of all four YOLOv5 models (link).
YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):
If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training (train.py), validation (val.py), inference (detect.py) and export (export.py) on MacOS, Windows, and Ubuntu every 24 hours and on every commit.