Research

Auto Added by WPeMatico

OpenAI o1 System Card

This report outlines the safety work carried out prior to releasing OpenAI o1 and o1-mini, including external red teaming and frontier risk evaluations according to our Preparedness Framework.

OpenAI o1 System Card Read More »

Advancing red teaming with people and AI

Artificial Intelligence, Research

Advancing red teaming with people and AI

Advancing red teaming with people and AI Read More »

Introducing SimpleQA

Artificial Intelligence, Research

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.

Introducing SimpleQA Read More »

Simplifying, stabilizing, and scaling continuous-time consistency models

Artificial Intelligence, Research

We’ve simplified, stabilized, and scaled continuous-time consistency models, achieving comparable sample quality to leading diffusion models, while using only two sampling steps.

Simplifying, stabilizing, and scaling continuous-time consistency models Read More »

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Artificial Intelligence, Research

We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering Read More »

Evaluating fairness in ChatGPT

Artificial Intelligence, Research

We’ve analyzed how ChatGPT responds to users based on their name, using AI research assistants to protect privacy.

Evaluating fairness in ChatGPT Read More »

Adding Project Specific Instructions via the Projects App

Introducing Translator Copilot: Bridging Customers and Translators with AI

2023, AI, Automation & Tech, Artificial Intelligence, Blog, Language, Large Language Model, LLM, Quality, Research, Translation Quality

Translator Copilot is Unbabel’s new AI assistant built directly into our CAT tool. It leverages large language models (LLMs) and Unbabel’s proprietary Quality Estimation (QE) technology to act as a smart second pair of eyes for every translation. From checking whether customer instructions are followed to flagging potential errors in real time, Translator Copilot strengthens

Introducing Translator Copilot: Bridging Customers and Translators with AI Read More »

Announcing Tower: An Open Multilingual LLM for Translation-Related Tasks

2023, AI, Automation & Tech, Artificial Intelligence, Blog, Language, Large Language Model, Localization & Translation, NLP and MT, Quality, Research, Translation, Translation Quality

Updated February 9, 2024 to include the newest iteration of Tower models. We are thrilled to announce the release of Tower, a suite of multilingual large language models (LLM) optimized for translation-related tasks. Tower is built on top of LLaMA2 [1], comes in two sizes — 7B and 13B parameters —, and currently supports 10

Announcing Tower: An Open Multilingual LLM for Translation-Related Tasks Read More »

Sequence Feature Extraction for Malware Family Analysis via Graph Neural Network

Artificial Intelligence, arXiv, Ciberseguridad | 🇬🇧 Cybersecurity, Creative Commons, cybersecurity, Cybersecurity (Paper), D. Learning, English, English (Paper), Inside The wall, InsidetheRss, Investigación | 🇬🇧 Research, news, Numbered, Open Access, Papers, Programación y frameworks en IA | 🇬🇧 Programming and frameworks in AI, Research

Malicious software (malware) causes much harm to our devices and life. We are eager to understand the malware behavior and the threat it made. Most of the record files of malware are variable length and text-based files with time stamps, such as event log data and dynamic analysis profiles. Using the time stamps, we can

Sequence Feature Extraction for Malware Family Analysis via Graph Neural Network Read More »

Scientific Visualization: Python + Matplotlib

#Programmer, articles, Artificial Intelligence, by-nc-sa, Ciencia de Datos | 🇬🇧 Data Science, Creative Commons, Ebooks, English (Articles), English (Ebook), Inside The wall, InsidetheRss, Investigación | 🇬🇧 Research, news, Numbered, Open Access, Programación y frameworks en IA | 🇬🇧 Programming and frameworks in AI, Programming (Article), Python, Python libraries, Research

The Python scientific visualisation landscape is huge. It is composed of a myriad of tools, ranging from the most versatile and widely used down to the more specialised and confidential. Some of these tools are community based while others are developed by companies. Some are made specifically for the web, others are for the desktop

Scientific Visualization: Python + Matplotlib Read More »