This website uses cookies
Read our Privacy policy and Terms of use for more information.
How AI models are built, what sets each generation apart, and where their architectures lead – from frontier LLMs to world models, traced in context.
AI 101
+2

13 min read
Jun 4, 2026
NVIDIA Cosmos is a platform of world foundation models for Physical AI: video curation, tokenizer, diffusion and autoregressive WFMs, and guardrails explained. Plus sensational 2026 update – Cosmos 3 omnimodel world model

AI 101
+2

13 min read
Apr 8, 2026
How much intelligence can you extract from the hardware you already have? Gemma 4 has the answer

AI 101
+3

12 min read
Mar 18, 2026
How NVIDIA amplifies the open model space with an outstanding lineup of partners: Black Forest Labs, Cursor, LangChain, Mistral AI, Perplexity, Reflection AI, Sarvam and Thinking Machines


AI 101
+2

15 min read
Jan 21, 2026
What are VLA models? Learn Vision-Language-Action architecture, key systems (π0, Helix, SmolVLA) & the leap to VLA+. Deep dive.


AI 101
+2

11 min read
Nov 19, 2025
LeJEPA by Yann LeCun: provably stable self-supervised learning without heuristics. SIGReg, isotropic Gaussian embeddings & world models explained. Turing Post.


AI 101
+1

9 min read
Nov 12, 2025
Tracing the rise of China’s agentic intelligence strategy – from Kimi’s early vision to today’s open-source breakthrough


AI 101
+2

13 min read
Oct 8, 2025
A glimpse at Code World Model, PSI, and others – redefining how models catch the world in their nets

Concepts
+2

11 min read
Sep 17, 2025
Everything you need to know about models that defend AI today


Concepts
+4

10 min read
Aug 27, 2025
Explore how rethinking world model building patterns can turn our vision upside down and lead to a new Physical, Agentic, and Nested (PAN) system

AI 101
+1

12 min read
Aug 6, 2025
What is GPT-OSS? OpenAI's open-weight MoE models explained: architecture, Ollama setup, memory requirements, and benchmarks vs DeepSeek & Qwen3.



AI 101
+1

4 min read
Jul 23, 2025
Refreshing Smol and Qwen models, Liquid Foundation Models with latest Hyena Edge, and legendary BERT

AI 101
+1

14 min read
Jun 18, 2025
Reasoning models use chain-of-thought to solve complex problems. Compare o1, DeepSeek-R1 & QwQ — and learn when to use each.


AI 101
+3

11 min read
May 28, 2025
BERT explained: how bidirectional pre-training works, MLM vs NSP, fine-tuning, RoBERTa, DistilBERT, ModernBERT, NeoBERT, and ConstBERT for retrieval.

AI 101
+2

11 min read
Apr 30, 2025
we discuss a new wave of architecture from Liquid AI – built from first principles, optimized for real hardware, and challenging the Transformer playbook with smarter, leaner models

AI 101
+3

12 min read
Apr 9, 2025
A deep dive into the history and current advancements in world models and why they are an important puzzle piece for the future of AI

AI 101
+2

9 min read
Mar 19, 2025
we discuss the timeline of Qwen models, focusing on their agentic capabilities and how they compete with other models, and also explore what is Qwen-Agent framework and how you can use it

AI 101
+1

12 min read
Feb 26, 2025
We explore the power of datasets and their integration in Hugging Face's small language models family, particularly in SmolLM2.

AI 101
+1

6 min read
Nov 27, 2024
Let's explore a smarter Vision-Language Model (VLM) that thinks step-by-step

AI 101
+1

6 min read
Nov 6, 2024
we trace Mistral's strategic roadmap and and unpack the unique performance of les Ministraux (Ministral)

AI 101
+1

6 min read
Oct 16, 2024
Explore how OpenAI made their automatic speech recognition (ASR) model multilingual and multitasking

AI 101
+1

5 min read
Sep 25, 2024
OLMoE: open-source sparse Mixture-of-Experts with 1B active and 7B total parameters. How it works, how it was trained, and why it matters for open-source AI.


AI 101
+1

8 min read
Aug 28, 2024
We discuss the innovation suggested by the DeepSeek team, how it improves the models' performance, and dive into the architectures and implementation of the models


Turing Post is an AI newsletter for engineers, researchers, founders, and technical managers who want to understand how machine learning and AI actually work.
Built on more than two decades in tech and seven years focused on AI, we track the research that matters, the systems being built, and the ideas shaping the field, from LLMs and AI agents to JEPA, world models, retrieval, inference, evaluation, AI infrastructure, and agentic workflows.
Join 110,000+ professionals who rely on Turing Post for precise, grounded analysis of AI’s past, present, and future.