LLMs are next-word predictors
From 3Blue1Brown. Full video: https://youtu.be/wjZofJX0v4M
You know, I get through about half the video before it goes all “rest of the f***ing owl” on me, but it has amazing insights to the world of math and geometry.
From 3Blue1Brown. Full video: https://youtu.be/wjZofJX0v4M
From 3Blue1Brown. A behind-the-scenes look at how I animate videos. Code for all the videos: https://github.com/3b1b/videos Manim: https://github.com/3b1b/manim Community edition: https://github.com/ManimCommunity/manim/ I added some more details about the workflow shown in this video to the readme of the videos repo: https://github.com/3b1b/videos?tab=readme-ov-file#workflow These lessons are funded directly by viewers: https://3b1b.co/support Timestamp: 0:00 – Intro 2:39 –…
From 3Blue1Brown. Full video: https://youtu.be/EmKQsSDlaa4
From 3Blue1Brown. Full video: https://youtu.be/EmKQsSDlaa4
From 3Blue1Brown. 3d scenes on 2d film, and a diffraction lesson along the way. Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support An equally valuable form of support is to share the videos. Gabor’s Nobel Prize lecture: https://www.nobelprize.org/uploads/2018/06/gabor-lecture.pdf A few resources we found helpful for this video Seeing the Light,…
From 3Blue1Brown. Why max(rand(), rand()) is the same as sqrt(rand()) See Matt Parker’s video for more: https://youtu.be/ga9Qk38FaHM
From 3Blue1Brown. How do think about max(rand(), rand()) The next short finishes the explanation: https://youtube.com/shorts/lpzUZDefha0
From 3Blue1Brown. max(rand(), rand()) has the same effect as sqrt(rand()) The next short explains why: https://youtube.com/shorts/sNWDjbaT208 See Matt Parker’s video for more: https://youtu.be/ga9Qk38FaHM
From 3Blue1Brown. Unpacking the multilayer perceptrons in a transformer, and how they may store facts Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support An equally valuable form of support is to share the videos. AI Alignment forum post from the Deepmind researchers referenced at the video’s start: https://www.alignmentforum.org/posts/iGuwZTHWb6DFY3sKB/fact-finding-attempting-to-reverse-engineer-factual-recall Anthropic posts…
From 3Blue1Brown. A link to the full video is on the screen, or here for reference: https://youtu.be/W3I3kAg2J7w
From 3Blue1Brown. I had the pleasure of being invited to give Harvey Mudd’s commencement speech this year. Reposted here with permission from the University Timestamps: 0:00 – End of Harriet Nembhard’s introduction 0:45 – The cliché 2:28 – The shifting goal 5:57 – Action precedes motivation 7:02 – Timing 10:47 – Know your influence 12:05…
From 3Blue1Brown. This comes from a full video breaking down how LLMs work. The link is on the bottom of the screen (in the shorts feed at least), or here for reference: https://youtu.be/wjZofJX0v4M
From 3Blue1Brown. This comes from a full video dissecting how LLMs work. In the shorts player, you can click the link at the bottom of the screen, or for reference: https://youtu.be/wjZofJX0v4M
From 3Blue1Brown. Demystifying attention, the key mechanism inside transformers and LLMs. Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support Special thanks to these supporters: https://www.3blue1brown.com/lessons/attention#thanks An equally valuable form of support is to simply share the videos. Demystifying self-attention, multiple heads, and cross-attention. Instead of sponsored ad reads, these lessons…
From 3Blue1Brown. Breaking down how Large Language Models work Instead of sponsored ad reads, these lessons are funded directly by viewers: https://3b1b.co/support — Here are a few other relevant resources Build a GPT from scratch, by Andrej Karpathy If you want a conceptual understanding of language models from the ground up, @vcubingx just started a…
From 3Blue1Brown. A link to the full video is at the bottom of the screen. Or, for reference: https://youtu.be/aXRTczANuIs
From 3Blue1Brown. A link to the full video answering this is at the bottom of the screen. Or, for reference: https://youtu.be/LqbZpur38nw Thanks to these viewers for their contributions to translations Bulgarian: Martin Grozdanov French: GiveMeChocolate, Yoyodotpy German: Josh, dlatikay Hebrew: Omer Tuchfeld Hindi: rajeshwar-pandey Spanish: Marcelo Lynch
From 3Blue1Brown. A link to the full video answering this is at the bottom of the screen. Or, for reference: https://youtu.be/bOXCLR3Wric Thanks to these viewers for their contributions to translations French: GiveMeChocolate Hindi: rajeshwar-pandey Spanish: Yago Iglesias
From 3Blue1Brown. A link to the full video is at the bottom of the screen. Or, for reference: https://youtu.be/pQa_tWZmlGs The full video this comes from proves why slicing a cone gives the same shape as the two-thumbtacks-and-string construction, which is beautiful. Editing from long-form to short by Dawid Kołodziej
From 3Blue1Brown. A link to the full video is at the bottom of the screen. Or, for reference: https://youtu.be/HZGCoVF3YvM Editing from long-form to short by Dawid Kołodziej