Laboratory for Information and Decision Systems (LIDS)

Auto Added by WPeMatico

Study: Platforms that rank the latest LLMs can be unreliable

A firm that wants to use a large language model (LLM) to summarize sales reports or triage customer inquiries can choose between hundreds of unique LLMs with dozens of model variations, each with slightly different performance.To narrow down the choice, companies often rely on LLM ranking platforms, which gather user feedback on model interactions to […]

Study: Platforms that rank the latest LLMs can be unreliable Read More »

Why it’s critical to move beyond overly aggregated machine-learning metrics

MIT researchers have identified significant examples of machine-learning model failure when those models are applied to data other than what they were trained on, raising questions about the need to test whenever a model is deployed in a new setting.“We demonstrate that even when you train models on large amounts of data, and choose the

Why it’s critical to move beyond overly aggregated machine-learning metrics Read More »

3 Questions: How AI could optimize the power grid

Artificial intelligence has captured headlines recently for its rapidly growing energy demands, and particularly the surging electricity usage of data centers that enable the training and deployment of the latest generative AI models. But it’s not all bad news — some AI tools have the potential to reduce some forms of energy consumption and enable cleaner grids.One

3 Questions: How AI could optimize the power grid Read More »

MIT scientists investigate memorization risk in the age of clinical AI

What is patient privacy for? The Hippocratic Oath, thought to be one of the earliest and most widely known medical ethics texts in the world, reads: “Whatever I see or hear in the lives of my patients, whether in connection with my professional practice or not, which ought not to be spoken of outside, I

MIT scientists investigate memorization risk in the age of clinical AI Read More »

New method improves the reliability of statistical estimations

Let’s say an environmental scientist is studying whether exposure to air pollution is associated with lower birth weights in a particular county.They might train a machine-learning model to estimate the magnitude of this association, since machine-learning methods are especially good at learning complex relationships.Standard machine-learning methods excel at making predictions and sometimes provide uncertainties, like

New method improves the reliability of statistical estimations Read More »

A smarter way for large language models to think about hard problems

To make large language models (LLMs) more accurate when answering harder questions, researchers can let the model spend more time thinking about potential solutions.But common approaches that give LLMs this capability set a fixed computational budget for every problem, regardless of how complex it is. This means the LLM might waste computational resources on simpler questions

A smarter way for large language models to think about hard problems Read More »

MIT engineers design an aerial microrobot that can fly as fast as a bumblebee

In the future, tiny flying robots could be deployed to aid in the search for survivors trapped beneath the rubble after a devastating earthquake. Like real insects, these robots could flit through tight spaces larger robots can’t reach, while simultaneously dodging stationary obstacles and pieces of falling rubble.So far, aerial microrobots have only been able

MIT engineers design an aerial microrobot that can fly as fast as a bumblebee Read More »

New control system teaches soft robots the art of staying safe

Imagine having a continuum soft robotic arm bend around a bunch of grapes or broccoli, adjusting its grip in real time as it lifts the object. Unlike traditional rigid robots that generally aim to avoid contact with the environment as much as possible and stay far away from humans for safety reasons, this arm senses

New control system teaches soft robots the art of staying safe Read More »

Researchers discover a shortcoming that makes LLMs less reliable

Large language models (LLMs) sometimes learn the wrong lessons, according to an MIT study.Rather than answering a query based on domain knowledge, an LLM could respond by leveraging grammatical patterns it learned during training. This can cause a model to fail unexpectedly when deployed on new tasks.The researchers found that models can mistakenly link certain

Researchers discover a shortcoming that makes LLMs less reliable Read More »

A faster problem-solving tool that guarantees feasibility

Managing a power grid is like trying to solve an enormous puzzle.Grid operators must ensure the proper amount of power is flowing to the right areas at the exact time when it is needed, and they must do this in a way that minimizes costs without overloading physical infrastructure. Even more, they must solve this

A faster problem-solving tool that guarantees feasibility Read More »