Enhancing Large Language Models: The Power of Continued Pre-Training๐๐ก๐๐ญ ๐ข๐ฌ ๐๐จ๐ง๐ญ๐ข๐ง๐ฎ๐๐ ๐ฉ๐ซ๐-๐ญ๐ซ๐๐ข๐ง๐ข๐ง๐ 3d ago3d ago
Reproducing GPT-2 124M: Key Insights from Andrej Karpathyโs 4-Hour Deep DiveThis is a summary of Andrej karpathyโs video about pre-training a GPT-2 124M parameter model from scratch.Feel free to check it using thisโฆSep 10Sep 10
Multi-Agent System with Crew AI: A Short Course SummaryPlease note that this article contains direct quotes content from the short course on multi-agent systems with Crew AI.Jul 4Jul 4
Stanfordโs CS25: Lecture 2summaryLecture 2 of Stanford Universityโs CS25 V4 Transformers course was delivered by Jason Wei and Hyung Won Chung. Highly recommended to watchโฆJul 2Jul 2
Stanfordโs CS25: Lecture 1 summaryI recently watched the first lecture of Stanford University CS25 V4 Transformers course, and presented by Div Garg, Steven Feng, EmilyโฆJun 11Jun 11
๐๐ข๐ฌ๐ฎ๐๐ฅ๐ข๐ณ๐๐ญ๐ข๐จ๐ง ๐จ๐ ๐๐ก๐จ๐ฎ๐ ๐ก๐ญ ๐๐ฅ๐ข๐๐ข๐ญ๐ฌ ๐๐ฉ๐๐ญ๐ข๐๐ฅ ๐๐๐๐ฌ๐จ๐ง๐ข๐ง๐ โฆImagine a scenario where youโre listening to a story. As you follow along, your mind naturally starts to visualize the the unseen objectsโฆMay 24May 24
LLMOps IntroductionI recently completed a short course called LLMOps by DeepLearning.AI, in collaboration with Google Cloud, instructed by Erwin HuizengaMay 12May 12
How to Optimize LLMs for Efficient ServingWe all know that serving LLMs in production is complicated. It requires extensive research and the design of the best architecture andโฆMay 8May 8
Mastering LLM: A Comprehensive Guide to Legal Language Model ServingWe all know that serving LLMs in production is complicated. It requires extensive research and the design of the best architecture andโฆMay 2May 2