Duarte O.Carmo's articles

#57 Faísca: The modern LLM stack in a single script


Listen Later

  • Why do this?
  • A small dataset of news headlines
  • GPT2 in PyTorch
  • Pre-training headlines in Portuguese
  • Supervised fine-tuning (SFT) on Portuguese from Portugal
  • Reinforcement Learning (GRPO) for sports news
  • Final thoughts & Acknowledgements
  • Why do this?

    ML and AI are moving at an incredible pace. The amount of research coming out …

    ...more
    View all episodesView all episodes
    Download on the App Store

    Duarte O.Carmo's articlesBy Duarte O.Carmo