Announcing a new, open suite of tools for language model interpretability Large Language Models (LLMs) are capable of incredible feats of reasoning, yet their internal decision-making processes remain largely opaque.…
Image by Author
# Introduction
OCR (Optical Character Recognition) models are gaining new recognition every day. I am seeing new open-source models pop up on Hugging Face…
Google 2025 recap: Research breakthroughs of the year
Source link
Image by Author
# Introduction
We all have those tasks that eat up our time without adding real value. These include sorting downloaded files, renaming photos, backing up…
Thinking Machines Lab has moved its Tinker training API into general availability and added 3 major capabilities, support for the Kimi K2 Thinking reasoning model, OpenAI compatible sampling, and image…
Today, we're expanding the Gemini 3 model family with the release of Gemini 3 Flash, which offers frontier intelligence built for speed at a fraction of the cost. With this…
Image by Author
# Introduction
We have all spent hours debugging a model, only to discover that it wasn't the algorithm but a wrong null value manipulating your…
What customers are saying Google Cloud customers are already using Gemini’s native audio capabilities to drive real business results, from mortgage processing to customer calls. “Users often forget they’re talking…
Image by Author
# Introduction
Are we all in a race to the bottom created by ourselves? Data professionals have been employed for years to develop large language…
Zhipu AI has open sourced the GLM-4.6V series as a pair of vision language models that treat images, video and tools as first class inputs for agents, not as afterthoughts…