Topic

LLM

4 articles

AI Geopolitics Tech Industry Developer Tools Cybersecurity AI Models AI Infrastructure Infrastructure Semiconductors Cloud Infrastructure Web Development Iran

AI Microsoft Developer Tools GitHub Copilot OpenAI Azure LLM

Microsoft Launches 7 In-House AI Models at Build 2026 to Cut OpenAI Reliance

Microsoft unveiled 7 in-house MAI models at Build 2026: MAI-Thinking-1 (35B active params, no OpenAI data) and MAI-Code-1-Flash now live in GitHub Copilot and VS Code.

June 7, 2026·8 min read

AI Google Developer Tools LLM

Gemini 3 Deep Think Hits 84.6% on ARC-AGI-2 and 3455 Elo on Codeforces

Google's Gemini 3 Deep Think scored 84.6% on ARC-AGI-2, 48.4% on Humanity's Last Exam, and 3455 Elo on Codeforces. Gemini 3.1 Pro is now in preview. Here is what the benchmarks actually mean.

March 30, 2026·8 min read

Xiaomi AI Models China LLM

Xiaomi's Hunter Alpha: The 1-Trillion-Parameter Model That Ran Anonymous for 8 Days

On March 11 a mystery 1-trillion-parameter model appeared on OpenRouter. The AI community burned 500 billion tokens assuming it was DeepSeek V4. On March 19 Xiaomi revealed it was theirs.

March 23, 2026·7 min read

AI Developers RAG LLM

RAG in Production 2026: Chunking Strategies, Embedding Costs, and What Actually Works at Scale

Most RAG tutorials show you how to build a demo. This post covers what breaks in production: chunking at 512 tokens beats semantic splitting, embedding costs range from $0.02 to $0.18 per million tokens, re-ranking boosts precision by 18–42%, and agentic RAG is now the 2026 standard. A practical guide for developers shipping RAG to real users.

March 11, 2026·12 min read