Normal view

24 July 2025 at 14:37

On Wednesday, the White House released "Winning the Race: America's AI Action Plan," a 25-page document that outlines the Trump administration's strategy to "maintain unquestioned and unchallenged global technological dominance" in AI through deregulation, infrastructure investment, and international partnerships. But critics are already taking aim at the plan, saying it's doing Big Tech a big favor.

Assistant to the President for Science and Technology Michael Kratsios and Special Advisor for AI and Crypto David Sacks crafted the plan, which frames AI development as a race the US must win against global competitors, particularly China.

The document describes AI as the catalyst for "an industrial revolution, an information revolution, and a renaissance—all at once." It calls for removing regulatory barriers that the administration says hamper private sector innovation. The plan explicitly reverses several Biden-era policies, including Executive Order 14110 on AI model safety measures, which President Trump rescinded on his first day in office during his second term.

VentureBeat
Mixture-of-recursions delivers 2x faster inference—Here’s how to implement it 23 July 2025 at 00:05

Mixture-of-recursions delivers 2x faster inference—Here’s how to implement it

VentureBeat

23 July 2025 at 00:05

Mixture-of-Recursions (MoR) is a new AI architecture that promises to cut LLM inference costs and memory use without sacrificing performance.Read More

Ars Technica
ChatGPT’s new AI agent can browse the web and create PowerPoint slideshows 17 July 2025 at 20:41

ChatGPT’s new AI agent can browse the web and create PowerPoint slideshows

Ars Technica

17 July 2025 at 20:41

On Thursday, OpenAI launched ChatGPT Agent, a new feature that lets the company's AI assistant complete multi-step tasks by controlling its own web browser. The update merges capabilities from OpenAI's earlier Operator tool and the Deep Research feature, allowing ChatGPT to navigate websites, run code, and create documents while users maintain control over the process.

The feature marks OpenAI's latest entry into what the tech industry calls "agentic AI"—systems that can take autonomous multi-step actions on behalf of the user. OpenAI says users can ask Agent to handle requests like assembling and purchasing a clothing outfit for a particular occasion, creating PowerPoint slide decks, planning meals, or updating financial spreadsheets with new data.

The system uses a combination of web browsers, terminal access, and API connections to complete these tasks, including "ChatGPT Connectors" that integrate with apps like Gmail and GitHub.

Ars Technica
Google hides secret message in name list of 3,295 AI researchers 17 July 2025 at 17:12

Google hides secret message in name list of 3,295 AI researchers

Ars Technica

17 July 2025 at 17:12

How many Google AI researchers does it take to screw in a lightbulb? A recent research paper detailing the technical core behind Google's Gemini AI assistant may suggest an answer, listing an eye-popping 3,295 authors.

It's a number that recently caught the attention of machine learning researcher David Ha (known as "hardmaru" online), who revealed on X that the first 43 names also contain a hidden message. "There’s a secret code if you observe the authors’ first initials in the order of authorship," Ha wrote, relaying the Easter egg: "GEMINI MODELS CAN THINK AND GET BACK TO YOU IN A FLASH."

The paper, titled "Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities," describes Google's Gemini 2.5 Pro and Gemini 2.5 Flash AI models, which were released in March. These large language models, which power Google's chatbot AI assistant, feature simulated reasoning capabilities that produce a string of "thinking out loud" text before generating responses in an attempt to help them solve more difficult problems. That explains "think" and "flash" in the hidden text.

VentureBeat
Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems 16 July 2025 at 00:28

Google study shows LLMs abandon correct answers under pressure, threatening multi-turn AI systems

VentureBeat

16 July 2025 at 00:28

A DeepMind study finds LLMs are both stubborn and easily swayed. This confidence paradox has key implications for building AI applications.Read More

VentureBeat
OpenAI, Google DeepMind and Anthropic sound alarm: ‘We may be losing the ability to understand AI’ 15 July 2025 at 22:49

OpenAI, Google DeepMind and Anthropic sound alarm: ‘We may be losing the ability to understand AI’

VentureBeat

By:Michael Nuñez

15 July 2025 at 22:49

Scientists unite to warn that a critical window for monitoring AI reasoning may close forever as models learn to hide their thoughts.Read More

VentureBeat
A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models 11 July 2025 at 22:26

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

VentureBeat

11 July 2025 at 22:26

A new AI model learns to "think" longer on hard problems, achieving more robust reasoning and better generalization to novel, unseen tasks.Read More

VentureBeat
New 1.5B router model achieves 93% accuracy without costly retraining 7 July 2025 at 23:25

New 1.5B router model achieves 93% accuracy without costly retraining

VentureBeat

7 July 2025 at 23:25

Katanemo Labs' new LLM routing framework aligns with human preferences and adapts to new models without retraining.Read More

VentureBeat
Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30% 3 July 2025 at 22:00

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

VentureBeat

3 July 2025 at 22:00

Sakana AI's new inference-time scaling technique uses Monte-Carlo Tree Search to orchestrate multiple LLMs to collaborate on complex tasks.Read More

VentureBeat
Kumo’s ‘relational foundation model’ predicts the future your LLM can’t see 27 June 2025 at 19:40

Kumo’s ‘relational foundation model’ predicts the future your LLM can’t see

VentureBeat

27 June 2025 at 19:40

Forecasting is a fundamentally new capability that is missing from the current purview of generative AI. Here's how Kumo is changing that.Read More

VentureBeat
Beyond static AI: MIT’s new framework lets models teach themselves 23 June 2025 at 21:58

Beyond static AI: MIT’s new framework lets models teach themselves

VentureBeat

23 June 2025 at 21:58

MIT researchers developed SEAL, a framework that lets language models continuously learn new knowledge and tasks.Read More

VentureBeat
Meta’s new world model lets robots manipulate objects in environments they’ve never encountered before 12 June 2025 at 22:22

Meta’s new world model lets robots manipulate objects in environments they’ve never encountered before

VentureBeat

12 June 2025 at 22:22

A robot powered by V-JEPA 2 can be deployed in a new environment and successfully manipulate objects it has never encountered before.Read More

Ars Technica
New study shows why simulated reasoning AI models don’t yet live up to their billing 25 April 2025 at 21:43

New study shows why simulated reasoning AI models don’t yet live up to their billing

Ars Technica

25 April 2025 at 21:43

There's a curious contradiction at the heart of today's most capable AI models that purport to "reason": They can solve routine math problems with accuracy, yet when faced with formulating deeper mathematical proofs found in competition-level challenges, they often fail.

That's the finding of eye-opening preprint research into simulated reasoning (SR) models, initially listed in March and updated in April, that mostly fell under the news radar. The research serves as an instructive case study on the mathematical limitations of SR models, despite sometimes grandiose marketing claims from AI vendors.

What sets simulated reasoning models apart from traditional large language models (LLMs) is that they have been trained to output a step-by-step "thinking" process (often called "chain-of-thought") to solve problems. Note that "simulated" in this case doesn't mean that the models do not reason at all but rather that they do not necessarily reason using the same techniques as humans. That distinction is important because human reasoning itself is difficult to define.

Ars Technica
Researchers concerned to find AI models misrepresenting their “reasoning” processes 10 April 2025 at 22:37

Researchers concerned to find AI models misrepresenting their “reasoning” processes

Ars Technica

10 April 2025 at 22:37

Remember when teachers demanded that you "show your work" in school? Some new types of AI models promise to do exactly that, but new research suggests that the "work" they show can sometimes be misleading or disconnected from the actual process used to reach the answer.

New research from Anthropic—creator of the ChatGPT-like Claude AI assistant—examines simulated reasoning (SR) models like DeepSeek's R1, and its own Claude series. In a research paper posted last week, Anthropic's Alignment Science team demonstrated that these SR models frequently fail to disclose when they've used external help or taken shortcuts, despite features designed to show their "reasoning" process.

(It's worth noting that OpenAI's o1 and o3 series SR models were excluded from this study.)