Machine Learning

Auto Added by WPeMatico

Why it’s critical to move beyond overly aggregated machine-learning metrics

ai, AI (Artificial Intelligence), Artificial Intelligence, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Electrical engineering and computer science (EECS), Health care, Institute for Medical Engineering and Science (IMES), Laboratory for Information and Decision Systems (LIDS), Machine Learning, MIT Schwarzman College of Computing, Research, School of Engineering, Technology and society

MIT researchers have identified significant examples of machine-learning model failure when those models are applied to data other than what they were trained on, raising questions about the need to test whenever a model is deployed in a new setting.“We demonstrate that even when you train models on large amounts of data, and choose the […]

Why it’s critical to move beyond overly aggregated machine-learning metrics Read More »

Microsoft Research Releases OptiMind: A 20B Parameter Model that Turns Natural Language into Solver Ready Optimization Models

ai, AI (Artificial Intelligence), AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Large Language Model, Machine Learning, New Releases, Open Source, Staff, Tech News, Technology

Microsoft Research has released OptiMind, an AI based system that converts natural language descriptions of complex decision problems into mathematical formulations that optimization solvers can execute. It targets a long standing bottleneck in operations research, where translating business intent into mixed integer linear programs usually needs expert modelers and days of work. What OptiMind Is

Microsoft Research Releases OptiMind: A 20B Parameter Model that Turns Natural Language into Solver Ready Optimization Models Read More »

A Coding Guide to Understanding How Retries Trigger Failure Cascades in RPC and Event-Driven Architectures

ai, AI (Artificial Intelligence), Artificial Intelligence, Editors Pick, Machine Learning, Staff, Technology, Tutorials

In this tutorial, we build a hands-on comparison between a synchronous RPC-based system and an asynchronous event-driven architecture to understand how real distributed systems behave under load and failure. We simulate downstream services with variable latency, overload conditions, and transient errors, and then drive both architectures using bursty traffic patterns. By observing metrics such as

A Coding Guide to Understanding How Retries Trigger Failure Cascades in RPC and Event-Driven Architectures Read More »

OpenAI to test ads in ChatGPT as it burns through billions

Advertising, ai, AI (Artificial Intelligence), AI assistants, AI bubble, Artificial Intelligence, Biz & IT, chatbots, ChatGPT, chatgtp, Fidji Simo, Generative AI, Google, Machine Learning, OpenAI, Sam Altman

On Friday, OpenAI announced it will begin testing advertisements inside the ChatGPT app for some US users in a bid to expand its customer base and diversify revenue. The move represents a reversal for CEO Sam Altman, who in 2024 described advertising in ChatGPT as a “last resort” and expressed concerns that ads could erode

OpenAI to test ads in ChatGPT as it burns through billions Read More »

TSMC says AI demand is “endless” after record Q4 earnings

ai, AI (Artificial Intelligence), AI chips, AI Infrastructure, Amazon, arizona, Artificial Intelligence, Biz & IT, C.C. Wei, datacenters, Google, Machine Learning, Microsoft, nvidia, semiconductors, Taiwan, TSMC

On Thursday, Taiwan Semiconductor Manufacturing Company (TSMC) reported record fourth-quarter earnings and said it expects AI chip demand to continue for years. During an earnings call, CEO C.C. Wei told investors that while he cannot predict the semiconductor industry’s long-term trajectory, he remains bullish on AI. TSMC manufactures chips for companies including Apple, Nvidia, AMD,

TSMC says AI demand is “endless” after record Q4 earnings Read More »

Google AI Releases TranslateGemma: A New Family of Open Translation Models Built on Gemma 3 with Support for 55 Languages

agentic ai, ai, AI (Artificial Intelligence), AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Audio Language Model, Editors Pick, Language Model, Large Language Model, Machine Learning, New Releases, Open Source, Staff, Tech News, Technology, TTS

Google AI has released TranslateGemma, a suite of open machine translation models built on Gemma 3 and targeted at 55 languages. The family comes in 4B, 12B and 27B parameter sizes. It is designed to run across devices from mobile and edge hardware to laptops and a single H100 GPU or TPU instance in the

Google AI Releases TranslateGemma: A New Family of Open Translation Models Built on Gemma 3 with Support for 55 Languages Read More »

NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression

ai, AI (Artificial Intelligence), AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Machine Learning, New Releases, Open Source, Staff, Tech News, Technology

As context lengths move into tens and hundreds of thousands of tokens, the key value cache in transformer decoders becomes a primary deployment bottleneck. The cache stores keys and values for every layer and head with shape (2, L, H, T, D). For a vanilla transformer such as Llama1-65B, the cache reaches about 335 GB

NVIDIA AI Open-Sourced KVzap: A SOTA KV Cache Pruning Method that Delivers near-Lossless 2x-4x Compression Read More »

Wikipedia signs AI training deals with Microsoft, Meta, and Amazon

ai, AI (Artificial Intelligence), AI Infrastructure, ai training data, Amazon, Artificial Intelligence, Biz & IT, Generative AI, Google, jimmy wales, Large Language Models, Machine Learning, Meta, Microsoft, Mistral AI, non-profit, Perplexity, Wikimedia Enterprise, Wikimedia Foundation, Wikipedia

On Thursday, the Wikimedia Foundation announced licensing deals with Microsoft, Meta, Amazon, Perplexity, and Mistral AI, expanding its effort to charge major tech companies for using Wikipedia content to train the AI models that power AI assistants like Microsoft Copilot and OpenAI’s ChatGPT. While these same companies previously scraped Wikipedia without permission, the deals mean

Wikipedia signs AI training deals with Microsoft, Meta, and Amazon Read More »

Generative AI tool helps 3D print personal items that sustain daily use

3-D printing, ai, AI (Artificial Intelligence), Artificial Intelligence, Center for Bits and Atoms, Computer Science and Artificial Intelligence Laboratory (CSAIL), Computer science and technology, Design, Electrical engineering and computer science (EECS), Fabrication, Invention, Machine Learning, MIT Schwarzman College of Computing, School of Engineering

Generative artificial intelligence models have left such an indelible impact on digital content creation that it’s getting harder to recall what the internet was like before it. You can call on these AI tools for clever projects such as videos and photos — but their flair for the creative hasn’t quite crossed over into the

Generative AI tool helps 3D print personal items that sustain daily use Read More »