Coding puzzles, practices, and education, structured prediction, the culture of Hugging Face, large models, and the energy of New York.
Transcript: https://web.stanford.edu/class/cs224u/podcast/rush/
Sasha's websiteSasha on TwitterSasha on the Humans of AI podcastSasha on The Thesis Review Podcast with Sean WelleckSasha on the Talking Machines PodcastSasha interviewed by Sayak PaulHugging FacePyTorchThe Annotated TransformerThe Annotated AliceThe Annotated S4Sasha and Dan Oneață's declarative graphics library ChalkDrawing Big Ben in ChalkOpenNMTKen ShanBlog post by Ken and Dylan ThurstonEdward Z. YangStuart ShieberLiterate programmingSoumith ChintalaLua TorchTensorFlowGraham NeubigChris DyerDyNetJAXjax.vmapMatt JohnsonFinale Doshi-Velez, whose undergrad ML course inspired and informed Sasha'sTensor PuzzlesGPU PuzzlesA tweet that Chris added to his CVAdam PaszkeDougal MacLaurinDexNamed Tensor notationNamed Tensors in PyTorchTorchDimMini TorchTorch-StructSarah Hooker's paper 'The hardware lottery'Jacob AndreasKevin EllisHugging Face transformers libraryHugging Face datasets libraryHugging Face diffusers libraryHugging Face evaluate libraryscikit-learnBig Science blogBLOOMThe Technology Behind BLOOM TrainingCRFMEleutherT0 and PromptSourceWashington Post: Big Tech builds AI with bad data. So scientists sought better dataThe bet: Is Attention All You Need?Democratizing access to large-scale language models with OPT-175BEpic OPT-175 LogbookGoogle's PaLMUnited's shares plunge 76% on bogus bankruptcy reportImagenAlbert GuBell Labs