Codestin Search App

12/4/2025 • EN

#AI Horizons 25-11 – Apple–Google AI Pact

Apple licenses Google's 1.2T-parameter Gemini AI for Siri in a $1B/year deal, a strategic interim step before its own model in 2026.

ai large language models Partnership Private Cloud Compute Siri

Daniele Grandini

11/30/2025 • EN

ChatGPT is three years old today

A retrospective on ChatGPT's third anniversary, covering its surprising launch, initial internal skepticism, and unprecedented growth to 800 million users.

ai artificial intelligence Chatgpt large language models Openai

Simon Willison

11/29/2025 • EN

Quoting Wikipedia content guideline

Wikipedia's new guideline advises against using LLMs to generate new articles from scratch, highlighting limitations of AI in content creation.

ai ethics Content Guidelines generative ai large language models Wikipedia

Simon Willison

11/7/2025 • EN

Kimi K2 Thinking

Moonshot AI's Kimi K2 Thinking is a 1 trillion parameter open-weight model optimized for multi-step reasoning and long-running tool calls.

AI Agents Benchmarks large language models Model Quantization Tool Use

Simon Willison

7/22/2025 • EN

Who is LLM

Explores the common practice of developers assigning personas to Large Language Models (LLMs) to better understand their quirks and behaviors.

ai programming large language models llm personas prompt engineering software development

Martin Fowler

7/8/2025 • EN

AI Is a Human Interface Nightmare

The article argues that AI's non-deterministic nature clashes with traditional computer interfaces, creating a fundamental human-AI interaction problem.

ai Determinism human-computer interaction large language models user interface

Julien Danjou

7/3/2025 • EN

AI Repo of the Week: Generative AI for Beginners with JavaScript

A hands-on guide for JavaScript developers to learn Generative AI and LLMs through interactive lessons, projects, and a companion app.

generative ai JavaScript large language models prompt engineering Retrieval Augmented Generation

Code with Dan

3/29/2025 • EN

First Look at Reasoning From Scratch: Chapter 1

An introduction to reasoning in Large Language Models, covering concepts like chain-of-thought and methods to improve LLM reasoning abilities.

ai large language models LLM Reasoning Machine Learning Reinforcement Learning

Sebastian Raschka

2/5/2025 • EN

Understanding Reasoning LLMs

Explores four main approaches to building and enhancing reasoning capabilities in Large Language Models (LLMs) for complex tasks.

ai development large language models llm Machine Learning Reasoning Models

Sebastian Raschka

1/1/2025 • EN

2024 highlights: of computer science and society

A researcher reflects on 2024 highlights in AI, covering societal impacts, software tools like Scikit-learn, and technical research on tabular data and language models.

ai large language models Machine Learning Scikit Learn tabular data

Gael Varoquaux

11/3/2024 • EN

Understanding Multimodal LLMs

Explains how multimodal LLMs work, compares recent models like Llama 3.2, and outlines two main architectural approaches for building them.

AI Research computer vision large language models Llama 32 Multimodal Llms

Sebastian Raschka

10/19/2024 • EN

Do AIs reason or recite?

Explores whether large language models like ChatGPT truly reason or merely recite memorized text from their training data, examining their logical capabilities.

artificial intelligence Autoregression Generalization large language models Machine Learning

Gael Varoquaux

4/20/2024 • EN

Using and Finetuning Pretrained Transformers

Explores methods for using and finetuning pretrained large language models, including feature-based approaches and parameter updates.

ai Finetuning large language models Machine Learning Transformers

Sebastian Raschka

3/20/2024 • EN

PALE Large Language Models, instead of ``Open Source.''

Argues that the term 'Open Source' is misleading for LLMs and proposes the new term 'PALE LLMs' (Publicly Available, Locally Executable).

ai ethics Free Software large language models Licensing open source

Fernando Castor

3/9/2024 • EN

Using AI tools for coding: good or bad?

Explores the balanced use of AI coding tools like GitHub Copilot, discussing benefits, risks of hallucinations, and best practices for developers.

AI Coding Tools Chatgpt code generation Github Copilot large language models

Andrea Grandi

5/11/2023 • EN

Accelerating Large Language Models with Mixed-Precision Techniques

Exploring mixed-precision techniques to speed up large language model training and inference by up to 3x without losing accuracy.

Deep Learning Floating Point Precision Gpu Optimization large language models Mixed Precision Training

Sebastian Raschka

4/12/2023 • EN

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

A guide to parameter-efficient finetuning methods for large language models, covering techniques like prefix tuning and LLaMA-Adapters.

Adapters large language models Llama Adapter Parameter Efficient Finetuning Prefix Tuning

Sebastian Raschka

3/28/2023 • EN

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Guide to finetuning large language models on a single GPU using gradient accumulation to overcome memory limitations.

Finetuning Gpu Memory Gradient Accumulation large language models Transformers

Sebastian Raschka

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Attention Mechanism large language models Machine Learning Natural Language Processing Transformers

Sebastian Raschka

1/16/2023 • EN

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Analyzes the limitations of AI chatbots like ChatGPT in providing accurate technical answers and discusses the need for curated data and human experts.

Chatgpt large language models LLM Training Perplexity AI Technical Misinformation

Sebastian Raschka

Large language models Articles

#AI Horizons 25-11 – Apple–Google AI Pact

ChatGPT is three years old today

Quoting Wikipedia content guideline

Kimi K2 Thinking

Who is LLM

AI Is a Human Interface Nightmare

AI Repo of the Week: Generative AI for Beginners with JavaScript

First Look at Reasoning From Scratch: Chapter 1

Understanding Reasoning LLMs

2024 highlights: of computer science and society

Understanding Multimodal LLMs

Do AIs reason or recite?

Using and Finetuning Pretrained Transformers

PALE Large Language Models, instead of ``Open Source.''

Using AI tools for coding: good or bad?

Accelerating Large Language Models with Mixed-Precision Techniques

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Understanding Large Language Models -- A Transformative Reading List

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Select Language