Burkov Andriy - The Hundred-Page Language Models Book + Code - 2025
665242beee7e4774b488c7c3bc79437ddf238317
Category: doc
Total Size: 25.43 MB
Piece Length: 32.00 KB
File Count: 42
First Seen: Mar 25, 2026, 10:28 PM
Last Seen: Mar 25, 2026, 11:33 PM
Hot Score: 5.00
Files
- Burkov Andriy - The Hundred-Page Language Models Book - 2025.pdf24.34 MB
- [DIR] Code-
- Code/README.md130 B
- Code/byte_pair_encoding.ipynb23.86 KB
- Code/count_language_model.ipynb20.87 KB
- Code/embedding_vs_linear.py1.82 KB
- Code/emotion_GPT2_as_classifier.ipynb24.52 KB
- Code/emotion_GPT2_as_text_generator.ipynb115.42 KB
- Code/emotion_GPT2_as_text_generator_LoRA.ipynb29.79 KB
- Code/emotion_classifier_LR.ipynb8.87 KB
- Code/instruct_GPT2.ipynb23.11 KB
- Code/news_RNN_language_model.ipynb397.73 KB
- Code/news_decoder_language_model.ipynb425.15 KB
- Code/quadratic_loss.py1.29 KB
- Code/sampling_method.ipynb18.00 KB
- [DIR] Code/wiki-
- Code/wiki/GPU-rental.md325 B
- Code/wiki/MoE.md409 B
- Code/wiki/PyTorch.md388 B
- Code/wiki/VLM.md611 B
- Code/wiki/alignment.md636 B
- Code/wiki/colabs.md1.12 KB
- Code/wiki/compression.md1.16 KB
- Code/wiki/deployment.md681 B
- Code/wiki/distributed.md331 B
- Code/wiki/embeddings.md991 B
- Code/wiki/encoder-decoder.md1.09 KB
- Code/wiki/encoder.md707 B
- Code/wiki/evaluation.md1.97 KB
- Code/wiki/function-calling.md669 B
- Code/wiki/index.md1.50 KB
- Code/wiki/inference.md3.08 KB
- Code/wiki/math.md1.27 KB
- Code/wiki/merging.md323 B
- Code/wiki/non-transformer.md1.18 KB
- Code/wiki/notebook-services.md337 B
- Code/wiki/online-finetuning.md151 B
- Code/wiki/overfitting.md553 B
- Code/wiki/prompting.md883 B
- Code/wiki/scaling.md372 B
- Code/wiki/scripts.md115 B
- Code/wiki/security.md601 B
- Code/wiki/test.md271 B
- Code/wiki/tokenization.md352 B