AI-101

AI Research Papers

Plain-language TL;DR summaries of the most influential AI research papers. Each entry covers what the paper does, why it matters, and links to the full text.

  1. Gemini 3 and Gemini 2.5 - Google DeepMind (2025-2026)
    Read summary
  2. GPT-5 System Card - OpenAI (August 2025)
    Read summary
  3. Claude 4 System Card - Anthropic (May 2025)
    Read summary
  4. DeepSeek-R1: Incentivizing Reasoning via Reinforcement Learning (January 2025)
    Read summary
  5. LLaMA 4: Natively Multimodal Open-Source AI - Meta (April 2025)
    Read summary
  6. Circuit Tracing: Mechanistic Interpretability Breakthrough - Anthropic (2025)
    Read summary
  7. Meta Chain-of-Thought: System 2 Reasoning in LLMs (January 2025)
    Read summary
  8. Alignment Faking in Large Language Models - Anthropic (December 2024)
    Read summary
  9. From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review (2025)
    Read summary
  10. AlphaGenome and AI for Science - Google DeepMind (2025-2026)
    Read summary
  11. Gemini: A Family of Highly Capable Multimodal Models (2023)
    Read summary
  12. The Claude 3 Model Family (2024)
    Read summary
  13. Direct Preference Optimization: Your Language Model Is Secretly a Reward Model (2023)
    Read summary
  14. Tree of Thoughts: Deliberate Problem Solving with Large Language Models (2023)
    Read summary
  15. Switch Transformers: Scaling to Trillion Parameter Models (2022)
    Read summary
  16. Mistral 7B (2023)
    Read summary
  17. LoRA: Low-Rank Adaptation of Large Language Models (2021)
    Read summary
  18. High-Resolution Image Synthesis with Latent Diffusion Models (2022)
    Read summary
  19. Denoising Diffusion Probabilistic Models (2020)
    Read summary
  20. DALL-E: Zero-Shot Text-to-Image Generation (2021)
    Read summary
  21. Scaling Laws for Neural Language Models (2020)
    Read summary
  22. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (2022)
    Read summary
  23. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (2020)
    Read summary
  24. GPT-4 Technical Report (2023)
    Read summary
  25. LLaMA: Open and Efficient Foundation Language Models (2023)
    Read summary
  26. Constitutional AI: Harmlessness from AI Feedback (2022)
    Read summary
  27. Training Language Models to Follow Instructions with Human Feedback (2022)
    Read summary
  28. Language Models Are Few-Shot Learners - GPT-3 (2020)
    Read summary
  29. BERT: Pre-training of Deep Bidirectional Transformers (2018)
    Read summary
  30. Attention Is All You Need (2017)
    Read summary