Laboratory for Information and Decision Systems (LIDS)

Auto Added by WPeMatico

A better method for identifying overconfident large language models

Large language models (LLMs) can generate credible but inaccurate responses, so researchers have developed uncertainty quantification methods to check the reliability of predictions. One popular method involves submitting the same prompt multiple times to see if the model generates the same answer.But this method measures self-confidence, and even the most impressive LLM might be confidently […]

A better method for identifying overconfident large language models Read More »

MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact

The early years of faculty members’ careers are a formative and exciting time in which to establish a firm footing that helps determine the trajectory of researchers’ studies. This includes building a research team, which demands innovative ideas and direction, creative collaborators, and reliable resources. For a group of MIT faculty working with and on artificial

MIT-IBM Watson AI Lab seed to signal: Amplifying early-career faculty impact Read More »

A better method for planning complex visual tasks

MIT researchers have developed a generative artificial intelligence-driven approach for planning long-term visual tasks, like robot navigation, that is about twice as effective as some existing techniques.Their method uses a specialized vision-language model to perceive the scenario in an image and simulate actions needed to reach a goal. Then a second model translates those simulations

A better method for planning complex visual tasks Read More »

AI to help researchers see the bigger picture in cell biology

Studying gene expression in a cancer patient’s cells can help clinical biologists understand the cancer’s origin and predict the success of different treatments. But cells are complex and contain many layers, so how the biologist conducts measurements affects which data they can obtain. For instance, measuring proteins in a cell could yield different information about the

AI to help researchers see the bigger picture in cell biology Read More »

Enhancing maritime cybersecurity with technology and policy

Originally from the small Balkan country of Montenegro, Strahinja (Strajo) Janjusevic says his life has unfolded in unexpected ways, for which he is deeply grateful. After graduating from high school, he was selected to represent his country in the United States, studying cyber operations and computer science at the U.S. Naval Academy in Annapolis, Maryland.

Enhancing maritime cybersecurity with technology and policy Read More »

Parking-aware navigation system could prevent frustration and emissions

It happens every day — a motorist heading across town checks a navigation app to see how long the trip will take, but they find no parking spots available when they reach their destination. By the time they finally park and walk to their destination, they’re significantly later than they expected to be.Most popular navigation

Parking-aware navigation system could prevent frustration and emissions Read More »

Personalization features can make LLMs more agreeable

Many of the latest large language models (LLMs) are designed to remember details from past conversations or store user profiles, enabling these models to personalize responses.But researchers from MIT and Penn State University found that, over long conversations, such personalization features often increase the likelihood an LLM will become overly agreeable or begin mirroring the

Personalization features can make LLMs more agreeable Read More »

Study: Platforms that rank the latest LLMs can be unreliable

A firm that wants to use a large language model (LLM) to summarize sales reports or triage customer inquiries can choose between hundreds of unique LLMs with dozens of model variations, each with slightly different performance.To narrow down the choice, companies often rely on LLM ranking platforms, which gather user feedback on model interactions to

Study: Platforms that rank the latest LLMs can be unreliable Read More »

Why it’s critical to move beyond overly aggregated machine-learning metrics

MIT researchers have identified significant examples of machine-learning model failure when those models are applied to data other than what they were trained on, raising questions about the need to test whenever a model is deployed in a new setting.“We demonstrate that even when you train models on large amounts of data, and choose the

Why it’s critical to move beyond overly aggregated machine-learning metrics Read More »

3 Questions: How AI could optimize the power grid

Artificial intelligence has captured headlines recently for its rapidly growing energy demands, and particularly the surging electricity usage of data centers that enable the training and deployment of the latest generative AI models. But it’s not all bad news — some AI tools have the potential to reduce some forms of energy consumption and enable cleaner grids.One

3 Questions: How AI could optimize the power grid Read More »