nanoGPT

install

pip install torch numpy transformers datasets tiktoken wandb tqdm

quick start

If you are not a deep learning professional and you just want to feel the magic and get your feet wet, the fastest way to get started is to train a character-level GPT on the works of Shakespeare. First, we download it as a single (1MB) file and turn it from raw text into one large stream of integers:

python data/chinese_modern_poetry/prepare.py
python data/chinese_laws_pretrain/prepare.py

This creates a train.bin and val.bin in that data directory. Now it is time to train your GPT. The size of it very much depends on the computational resources of your system:

I have a GPU. Great, we can quickly train a baby GPT with the settings provided in the config/train_gpt2_chinese_laws.py config file:

python train.py config/train_gpt2_chinese_poetry_debug.py
python train.py config/train_gpt2_chinese_poetry.py
python train.py config/train_gpt2_chinese_laws_debug.py
python train.py config/train_gpt2_chinese_laws.py

If you peek inside it, you'll see that we're training a GPT with a context size of up to 256 characters, 384 feature channels, and it is a 6-layer Transformer with 6 heads in each layer. On one A100 GPU this training run takes about 3 minutes and the best validation loss is 1.4697. Based on the configuration, the model checkpoints are being written into the --out_dir directory out-shakespeare-char. So once the training finishes we can sample from the best model by pointing the sampling script at this directory:

python sample-chinese-poetry.py --out_dir=out-chinese-poetry
python sample-chinese-laws.py --out_dir=out-chinese-laws

This generates a few samples

Name		Name	Last commit message	Last commit date
Latest commit History 212 Commits
assets		assets
config		config
data		data
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bench.py		bench.py
configurator.py		configurator.py
model.py		model.py
sample-chinese-laws.py		sample-chinese-laws.py
sample-chinese-poetry.py		sample-chinese-poetry.py
sample.py		sample.py
scaling_laws.ipynb		scaling_laws.ipynb
train.py		train.py
transformer_sizing.ipynb		transformer_sizing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nanoGPT

install

quick start

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nanoGPT

install

quick start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages