Computer vision

Auto Added by WPeMatico

Salesforce AI Introduces FOFPred: A Language-Driven Future Optical Flow Prediction Framework that Enables Improved Robot Control and Video Generation

Salesforce AI research team present FOFPred, a language driven future optical flow prediction framework that connects large vision language models with diffusion transformers for dense motion forecasting in control and video generation settings. FOFPred takes one or more images and a natural language instruction such as ‘moving the bottle from right to left’ and predicts […]

Salesforce AI Introduces FOFPred: A Language-Driven Future Optical Flow Prediction Framework that Enables Improved Robot Control and Video Generation Read More »

Black Forest Labs Releases FLUX.2 [klein]: Compact Flow Models for Interactive Visual Intelligence

Black Forest Labs releases FLUX.2 [klein], a compact image model family that targets interactive visual intelligence on consumer hardware. FLUX.2 [klein] extends the FLUX.2 line with sub second generation and editing, a unified architecture for text to image and image to image, and deployment options that range from local GPUs to cloud APIs, while keeping

Black Forest Labs Releases FLUX.2 [klein]: Compact Flow Models for Interactive Visual Intelligence Read More »

From cloud to factory – humanoid robots coming to workplaces

The partnership announced this week between Microsoft and Hexagon Robotics marks an inflection point in the commercialisation of humanoid, AI-powered robots for industrial environments. The two companies will combine Microsoft’s cloud and AI infrastructure with Hexagon’s expertise in robotics, sensors, and spatial intelligence to advance the deployment of physical AI systems in real-world settings. At

From cloud to factory – humanoid robots coming to workplaces Read More »

30 Best Data Science Books to Read in 2026

Data science powers decision-making across modern businesses, from data preparation and automation to advanced analytics and machine learning. Learning it requires a strong foundation in mathematics, statistics, programming, and practical problem-solving. The good news is that data science can be self-learned with the right resources and consistent practice. Books remain one of the most effective

30 Best Data Science Books to Read in 2026 Read More »

Multilingual Sentiment Analysis

Multilingual Sentiment Analysis – Importance, Methodology, and Challenges

The internet has become a massive, always-on focus group. Customers share opinions in product reviews, app store comments, support chats, social media posts, and community forums—often switching between languages and dialects in a single conversation. If you only analyze English, you’re ignoring a huge portion of what your customers actually feel. Recent estimates suggest roughly

Multilingual Sentiment Analysis – Importance, Methodology, and Challenges Read More »

A “scientific sandbox” lets researchers explore the evolution of vision systems

Why did humans evolve the eyes we have today?While scientists can’t go back in time to study the environmental pressures that shaped the evolution of the diverse vision systems that exist in nature, a new computational framework developed by MIT researchers allows them to explore this evolution in artificial intelligence agents.The framework they developed, in

A “scientific sandbox” lets researchers explore the evolution of vision systems Read More »

Thinking Machines Lab Makes Tinker Generally Available: Adds Kimi K2 Thinking And Qwen3-VL Vision Input

Thinking Machines Lab has moved its Tinker training API into general availability and added 3 major capabilities, support for the Kimi K2 Thinking reasoning model, OpenAI compatible sampling, and image input through Qwen3-VL vision language models. For AI engineers, this turns Tinker into a practical way to fine tune frontier models without building distributed training

Thinking Machines Lab Makes Tinker Generally Available: Adds Kimi K2 Thinking And Qwen3-VL Vision Input Read More »

Deep-learning model predicts how fruit flies form, cell by cell

During early development, tissues and organs begin to bloom through the shifting, splitting, and growing of many thousands of cells.A team of MIT engineers has now developed a way to predict, minute by minute, how individual cells will fold, divide, and rearrange during a fruit fly’s earliest stage of growth. The new method may one

Deep-learning model predicts how fruit flies form, cell by cell Read More »

Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling

Zhipu AI has open sourced the GLM-4.6V series as a pair of vision language models that treat images, video and tools as first class inputs for agents, not as afterthoughts bolted on top of text. Model lineup and context length The series has 2 models. GLM-4.6V is a 106B parameter foundation model for cloud and

Zhipu AI Releases GLM-4.6V: A 128K Context Vision Language Model with Native Tool Calling Read More »