National Science Foundation (NSF)

Auto Added by WPeMatico

New method could increase LLM training efficiency

Reasoning large language models (LLMs) are designed to solve complex problems by breaking them down into a series of smaller steps. These powerful models are particularly good at challenging tasks like advanced programming and multistep planning.But developing reasoning models demands an enormous amount of computation and energy due to inefficiencies in the training process. While […]

New method could increase LLM training efficiency Read More »

Parking-aware navigation system could prevent frustration and emissions

It happens every day — a motorist heading across town checks a navigation app to see how long the trip will take, but they find no parking spots available when they reach their destination. By the time they finally park and walk to their destination, they’re significantly later than they expected to be.Most popular navigation

Parking-aware navigation system could prevent frustration and emissions Read More »

Study: Platforms that rank the latest LLMs can be unreliable

A firm that wants to use a large language model (LLM) to summarize sales reports or triage customer inquiries can choose between hundreds of unique LLMs with dozens of model variations, each with slightly different performance.To narrow down the choice, companies often rely on LLM ranking platforms, which gather user feedback on model interactions to

Study: Platforms that rank the latest LLMs can be unreliable Read More »

How generative AI can help scientists synthesize complex materials

Generative artificial intelligence models have been used to create enormous libraries of theoretical materials that could help solve all kinds of problems. Now, scientists just have to figure out how to make them.In many cases, materials synthesis is not as simple as following a recipe in the kitchen. Factors like the temperature and length of

How generative AI can help scientists synthesize complex materials Read More »

MIT scientists investigate memorization risk in the age of clinical AI

What is patient privacy for? The Hippocratic Oath, thought to be one of the earliest and most widely known medical ethics texts in the world, reads: “Whatever I see or hear in the lives of my patients, whether in connection with my professional practice or not, which ought not to be spoken of outside, I

MIT scientists investigate memorization risk in the age of clinical AI Read More »

Guided learning lets “untrainable” neural networks realize their potential

Even networks long considered “untrainable” can learn effectively with a bit of a helping hand. Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have shown that a brief period of alignment between neural networks, a method they call guidance, can dramatically improve the performance of architectures previously thought unsuitable for modern tasks.Their findings

Guided learning lets “untrainable” neural networks realize their potential Read More »

Enabling small language models to solve complex reasoning tasks

As language models (LMs) improve at tasks like image generation, trivia questions, and simple math, you might think that human-like reasoning is around the corner. In reality, they still trail us by a wide margin on complex tasks. Try playing Sudoku with one, for instance, where you fill in numbers one through nine in such

Enabling small language models to solve complex reasoning tasks Read More »

New method improves the reliability of statistical estimations

Let’s say an environmental scientist is studying whether exposure to air pollution is associated with lower birth weights in a particular county.They might train a machine-learning model to estimate the magnitude of this association, since machine-learning methods are especially good at learning complex relationships.Standard machine-learning methods excel at making predictions and sometimes provide uncertainties, like

New method improves the reliability of statistical estimations Read More »

MIT engineers design an aerial microrobot that can fly as fast as a bumblebee

In the future, tiny flying robots could be deployed to aid in the search for survivors trapped beneath the rubble after a devastating earthquake. Like real insects, these robots could flit through tight spaces larger robots can’t reach, while simultaneously dodging stationary obstacles and pieces of falling rubble.So far, aerial microrobots have only been able

MIT engineers design an aerial microrobot that can fly as fast as a bumblebee Read More »

Researchers discover a shortcoming that makes LLMs less reliable

Large language models (LLMs) sometimes learn the wrong lessons, according to an MIT study.Rather than answering a query based on domain knowledge, an LLM could respond by leveraging grammatical patterns it learned during training. This can cause a model to fail unexpectedly when deployed on new tasks.The researchers found that models can mistakenly link certain

Researchers discover a shortcoming that makes LLMs less reliable Read More »