Skip to content

fix output softmax visualization#22

Open
jeremyschlatter wants to merge 1 commit into
bbycroft:mainfrom
jeremyschlatter:softmax
Open

fix output softmax visualization#22
jeremyschlatter wants to merge 1 commit into
bbycroft:mainfrom
jeremyschlatter:softmax

Conversation

@jeremyschlatter

Copy link
Copy Markdown

For the "SM Agg" matrix at the bottom of nano-gpt, the labels were flipped from the values. That is, the row containing the max values was labeled exp, and the row containing the exp values was labeled max.

The cause was that in this odin model, those rows were flipped compared to their counterparts in src/llm/GptModelLayout.ts and src/llm/GptModel.ts. Flipping them back fixes the problem.

Before:

Screenshot 2025-10-23 at 3 51 45 PM Screenshot 2025-10-23 at 3 55 53 PM

After:

Screenshot 2025-10-23 at 3 56 10 PM Screenshot 2025-10-23 at 3 56 15 PM

This commit also updates the path to native.wasm in build.sh. (build.sh got moved down a directory in 0ba3a39 "Move llm code into its own folder, but this relative path wasn't updated.)

Also deletes native.wasm.o. It's not clear to me that this was intentionally committed in the first place. build.sh deleted automatically.

For the "SM Agg" matrix at the bottom of nano-gpt, the labels were
flipped from the values. That is, the row containing the max values was
labeled exp, and the row containing the exp values was labeled max.

The cause was that in this odin model, those rows were flipped compared
to their counterparts in src/llm/GptModelLayout.ts and
src/llm/GptModel.ts. Flipping them back fixes the problem.

Also updates the path to native.wasm in build.sh. (build.sh got moved
down a directory in 0ba3a39 "Move llm code into its own folder", but
this relative path wasn't updated.)

Also deletes native.wasm.o. It's not clear to me that this was
intentionally committed in the first place. build.sh deleted it
automatically.
@vercel

vercel Bot commented Oct 24, 2025

Copy link
Copy Markdown

@jeremyschlatter is attempting to deploy a commit to the Brendan's Team Team on Vercel.

A member of the Team first needs to authorize it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant