Open in app

Sign in

Medium Logo
Write

Sign in

Ali Issa
Ali Issa

78 followers

Home

Lists

About

Building and Training Large Language Models (LLMs): A Stanford Lecture Summary

This is a summary of the Stanford CS229 Machine Learning lecture : Building Large Language Models (LLMs) by Yann Dubois

Apr 1
Building and Training Large Language Models (LLMs): A Stanford Lecture Summary
Building and Training Large Language Models (LLMs): A Stanford Lecture Summary
Apr 1

Understanding Reasoning in LLMs

These notes are based on Sebastian’s article about Understanding Reasoning LLMs from his newsletter. It offers valuable insights explained…

Feb 25
Understanding Reasoning in LLMs
Understanding Reasoning in LLMs
Feb 25

Key Takeaways from the MLOps Data Lifecycle in Production Course

As I revisit the Machine Learning Data Lifecycle in Production course (a refresher before diving into the third course), I’d like to share…

Jan 15
Key Takeaways from the MLOps Data Lifecycle in Production Course
Key Takeaways from the MLOps Data Lifecycle in Production Course
Jan 15

From Raw Data to Model Efficiency: Mastering Feature Engineering and Selection

In the world of machine learning, your model is only as good as the data it learns from. Transforming raw data into a structured…

Dec 6, 2024
From Raw Data to Model Efficiency: Mastering Feature Engineering and Selection
From Raw Data to Model Efficiency: Mastering Feature Engineering and Selection
Dec 6, 2024

Enhancing Large Language Models: The Power of Continued Pre-Training

𝐖𝐡𝐚𝐭 𝐢𝐬 𝐜𝐨𝐧𝐭𝐢𝐧𝐮𝐞𝐝 𝐩𝐫𝐞-𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠

Nov 17, 2024
1
Enhancing Large Language Models: The Power of Continued Pre-Training
Enhancing Large Language Models: The Power of Continued Pre-Training
Nov 17, 2024
1

Reproducing GPT-2 124M: Key Insights from Andrej Karpathy’s 4-Hour Deep Dive

This is a summary of Andrej karpathy’s video about pre-training a GPT-2 124M parameter model from scratch.Feel free to check it using this…

Sep 10, 2024
Reproducing GPT-2 124M: Key Insights from Andrej Karpathy’s 4-Hour Deep Dive
Reproducing GPT-2 124M: Key Insights from Andrej Karpathy’s 4-Hour Deep Dive
Sep 10, 2024

Multi-Agent System with Crew AI: A Short Course Summary

Please note that this article contains direct quotes content from the short course on multi-agent systems with Crew AI.

Jul 4, 2024
Multi-Agent System with Crew AI: A Short Course Summary
Multi-Agent System with Crew AI: A Short Course Summary
Jul 4, 2024

Stanford’s CS25: Lecture 2summary

Lecture 2 of Stanford University’s CS25 V4 Transformers course was delivered by Jason Wei and Hyung Won Chung. Highly recommended to watch…

Jul 2, 2024
Stanford’s CS25: Lecture 2summary
Stanford’s CS25: Lecture 2summary
Jul 2, 2024

Stanford’s CS25: Lecture 1 summary

I recently watched the first lecture of Stanford University CS25 V4 Transformers course, and presented by Div Garg, Steven Feng, Emily…

Jun 11, 2024
Stanford’s CS25: Lecture 1 summary
Stanford’s CS25: Lecture 1 summary
Jun 11, 2024

𝐕𝐢𝐬𝐮𝐚𝐥𝐢𝐳𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐓𝐡𝐨𝐮𝐠𝐡𝐭 𝐄𝐥𝐢𝐜𝐢𝐭𝐬 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠…

Imagine a scenario where you’re listening to a story. As you follow along, your mind naturally starts to visualize the the unseen objects…

May 24, 2024
𝐕𝐢𝐬𝐮𝐚𝐥𝐢𝐳𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐓𝐡𝐨𝐮𝐠𝐡𝐭 𝐄𝐥𝐢𝐜𝐢𝐭𝐬 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠…
𝐕𝐢𝐬𝐮𝐚𝐥𝐢𝐳𝐚𝐭𝐢𝐨𝐧 𝐨𝐟 𝐓𝐡𝐨𝐮𝐠𝐡𝐭 𝐄𝐥𝐢𝐜𝐢𝐭𝐬 𝐒𝐩𝐚𝐭𝐢𝐚𝐥 𝐑𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠…
May 24, 2024
Ali Issa

Ali Issa

78 followers
Following
  • Data Science Collective

    Data Science Collective

  • Taghridtaleb

    Taghridtaleb

  • Jana Kabrit

    Jana Kabrit

  • Andrej Karpathy

    Andrej Karpathy

  • Benjamin Marie

    Benjamin Marie

See all (105)

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech