Hugging Face 101 Train Decision Transformers

101 Train Decision Transformers

hf-bloghacktoberfestnotebooks

alph-notebooks/hf-blog / 101_train-decision-transformers.ipynb

Export

Run Notebooks

Contents

No cells yet

Add cells to see them here

Training Decision Transformers with 🤗 transformers

In this tutorial, you’ll learn to train your first Offline Decision Transformer model from scratch to make a half-cheetah run. 🏃

❓ If you have questions, please post them on #study-group discord channel 👉 https://discord.gg/aYka4Yhff9

🎮 Environments:

Half Cheetah

⬇️ Here's what you'll achieve at the end of this tutorial. ⬇️

[ ]

Prerequisites 🏗️

Before diving into the notebook, you need to:

🔲 📚 Read the tutorial

Step 0: Set the GPU 💪

To faster the agent's training, we'll use a GPU to do that go to Runtime > Change Runtime type

Hardware Accelerator > GPU

Step 1: Install dependencies for model evaluation 🔽

[ ]

Step 2: Install and import the packages 📦

[ ]

Step 3: Loading the dataset from the 🤗 Hub and instantiating the model

We host a number of Offline RL Datasets on the hub. Today we will be training with the halfcheetah “expert” dataset, hosted here on hub.

First we need to import the load_dataset function from the 🤗 datasets package and download the dataset to our machine.

[ ]

Step 4: Defining a custom DataCollator for the transformers Trainer class

[ ]

Step 5: Extending the Decision Transformer Model to include a loss function

In order to train the model with the 🤗 trainer class, we first need to ensure the dictionary it returns contains a loss, in this case L-2 norm of the models action predictions and the targets.

[ ]

Step 6: Defining the training hyperparameters and training the model

Here, we define the training hyperparameters and our Trainer class that we'll use to train our Decision Transformer model.

This step takes about an hour, so you may leave it running. Note the authors train for at least 3 hours, so the results presented here are not as performant as the models hosted on the 🤗 hub.

[ ]

Step 7: Visualize the performance of the agent

[ ]

Step 8: Publish our trained model on the Hub 🔥

Now that we saw we got good results after the training, we can publish our trained model on the hub 🤗 with one line of code.

Under the hood, the Hub uses git-based repositories (don't worry if you don't know what git is), which means you can update the model with new versions as you experiment and improve your agent.

To be able to share your model with the community there are three more steps to follow:

1️⃣ (If it's not already done) create an account to HF ➡ https://huggingface.co/join

2️⃣ Sign in and then, you need to store your authentication token from the Hugging Face website.

Create a new token (https://huggingface.co/settings/tokens) with write role

[ ]

If you don't want to use a Google Colab or a Jupyter Notebook, you need to use this command instead: huggingface-cli login

3️⃣ We're now ready to push our trained model to the 🤗 Hub 🔥 !!

[ ]

Some additional challenges 🏆

Congratulations, you've just trained your first Decision Transformer 🥳.

Now, the best way to learn is to try things by your own! Why not trying with another environment?

We provide datasets for some other environments:

Have fun!