LLM Playground

An interactive lab for understanding how large language models work—from tokenization through next-token prediction and text generation—without leaning on high-level frameworks for the core mechanics.

Intent

This repository serves two goals, in order of priority.

1. Primary: Learn how LLMs work under the hood

The main deliverable is llm_playground.ipynb, a hands-on notebook where you implement and experiment with the foundations of language modeling:

Tokenization — word-, character-, and subword-level (BPE); comparison with production tokenizers (tiktoken)
Language modeling — load GPT-2, inspect architecture, count parameters, and interpret logits and next-token probabilities
Decoding — greedy decoding and top-p (nucleus) sampling via Hugging Face generate()
Old vs. modern models — contrast GPT-2 (text completion) with instruction-tuned Qwen3-0.6B (chat templates, reasoning-style behavior)
Looking ahead — how inference engines (Ollama, vLLM, SGLang) fit into real serving

The emphasis is on building a mental model: what happens from raw text to generated output, and why each step exists. Later coursework introduces tools like Ollama and LangChain that abstract much of this; this project is the groundwork.

2. Secondary: Showcase that learning on a portfolio

A second goal is to communicate what you learned on a personal portfolio site: short write-ups, diagrams, and highly visual demos (e.g. token breakdowns, side-by-side GPT-2 vs. Qwen outputs, decoding comparisons) that let visitors explore ideas without reading the full notebook.

The notebook remains the source of truth for depth; the portfolio is the curated, visual layer—screenshots, recordings, embedded demos (e.g. Colab, Hugging Face Spaces), and explanations aimed at recruiters and peers.

Detailed specs for each planned demo live in portfolio/demos/. Those documents are the blueprint for what to build on the portfolio site (Gradio embeds, static sections, video, and copy). See the portfolio hub for site layout, tiers, and implementation phases.

What’s in this repo

Path	Purpose
`llm_playground.ipynb`	Main lab notebook (fill-in exercises + optional interactive playground)
`portfolio/README.md`	Portfolio hub: site structure, demo index, implementation roadmap
`portfolio/demos/*.md`	Per-demo specs (UX, copy, technical notes, build checklists)
`environment.yaml`	Conda environment (`llm_playground`)
`requirements.txt`	Python dependencies for local / uv setup
`env.sh`	Project-local `uv` and venv paths (source from repo root)
`demos-1/moonboots/`	Hugging Face Gradio app (Tokens, NextToken, Decoding, Playground tabs); deploy to moonbootspleb/moonboots
`demos-1/README.md`	Space folder index and HF deploy notes

Stack: Python 3.11, PyTorch, Hugging Face Transformers, tiktoken, JupyterLab, optional ipywidgets for the playground UI.

Models used in the notebook: GPT-2 (small), Qwen3-0.6B (downloaded from Hugging Face on first run).

Getting started

Option A: Google Colab (no local install)

Open Google Colab.
Upload llm_playground.ipynb or open it from GitHub.
Install dependencies in a cell if needed (see the notebook’s environment check).

Option B: Local environment

Conda:

conda env create -f environment.yaml && conda activate llm_playground

uv (faster):

curl -LsSf https://astral.sh/uv/install.sh | sh   # skip if already installed
uv venv .venv --python 3.11 && source .venv/bin/activate
uv pip install -r requirements.txt

Optional — register the Jupyter kernel:

python -m ipykernel install --user --name=llm_playground --display-name "llm_playground"

Open the notebook in JupyterLab, select the llm_playground kernel, and work through the cells in order.

Project-local tooling: from the repo root, source env.sh puts the project’s .venv and uv cache on your PATH.

Learning objectives (summary)

After completing the notebook, you should be able to:

Explain how text becomes token IDs and why subword/BPE tokenization is the default for modern LLMs
Load a pretrained causal LM and describe its main components at a high level
Relate logits and softmax probabilities to “what token comes next”
Compare greedy decoding vs. top-p sampling in behavior and tradeoffs
Articulate the difference between completion-style models (GPT-2) and instruction-tuned chat models (Qwen3)
Describe why production systems use dedicated inference servers instead of loading models inside every app process

Portfolio demos

These demos will be hosted on a personal portfolio site—not only documented in this repo. Each demo has a detailed spec under portfolio/demos/. Interactive demos are intended as Gradio apps on Hugging Face Spaces (embedded via iframe) or linked from the site; static demos are diagrams, galleries, or scroll narratives.

Recommended order on the portfolio page: hero summary → Tier 1 interactives → GPT-2 vs Qwen write-up → Tier 2 supporting content → links to full notebook (Colab / GitHub).

Demo	Tier	Type	Notebook	Spec	Status
Tokenizer explorer	1	Interactive (Space Tokens)	§1.3–1.4	spec	done
Next-token microscope	1	Interactive (Space NextToken)	§2.3	spec	done
Greedy vs top-p	1	Interactive (Space Decoding)	§3.1–3.2	spec	done
GPT-2 vs Qwen	1	Write-up + static (blog Part 2)	§4.1–4.2	spec	done
Tokenization ladder	2	React (blog Part 3)	§1.1–1.3	spec	done
Parameter scale calculator	2	React (blog Part 3)	§2.2	spec	done
Transformer block diagram	2	Static SVG (blog Part 3)	§2.1	spec	done
Decoding playground	2	Interactive (Space Playground)	§5	spec	done
Pipeline story	3	Static scroll (blog Part 4)	Full arc	spec	done
Demo walkthrough video	3	Video placeholder (blog Part 4)	§1–4	spec	placeholder
Failure modes gallery	3	Static gallery (blog Part 4)	§1.1, §3.1, §4.2	spec	done

For the full index, blog mapping, and build phases, see portfolio/README.md.

License

Add a license here if you plan to publish the repo publicly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Playground

Intent

1. Primary: Learn how LLMs work under the hood

2. Secondary: Showcase that learning on a portfolio

What’s in this repo

Getting started

Option A: Google Colab (no local install)

Option B: Local environment

Learning objectives (summary)

Portfolio demos

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
demos-1		demos-1
portfolio		portfolio
.gitignore		.gitignore
README.md		README.md
env.sh		env.sh
environment.yaml		environment.yaml
llm_playground.ipynb		llm_playground.ipynb
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

LLM Playground

Intent

1. Primary: Learn how LLMs work under the hood

2. Secondary: Showcase that learning on a portfolio

What’s in this repo

Getting started

Option A: Google Colab (no local install)

Option B: Local environment

Learning objectives (summary)

Portfolio demos

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages