hi@aiweekly.co.in

From MIT Dorm Room To $60 Billion: Meet The Four Founders Who Built AI Startup ‘Cursor’ & Sold It To SpaceX – News18

ai, AI (Artificial Intelligence), Artificial Intelligence

From MIT Dorm Room To $60 Billion: Meet The Four Founders Who Built AI Startup ‘Cursor’ & Sold It To SpaceX News18

From MIT Dorm Room To $60 Billion: Meet The Four Founders Who Built AI Startup ‘Cursor’ & Sold It To SpaceX – News18 Read More »

Bengaluru Ranks Second in Asia’s AI-Native Startup Clusters – GK Today

ai, AI (Artificial Intelligence), Artificial Intelligence

Bengaluru Ranks Second in Asia’s AI-Native Startup Clusters GK Today

Bengaluru Ranks Second in Asia’s AI-Native Startup Clusters – GK Today Read More »

Syke Founder Alistair Maiden Joins Flank

agentic ai, ai, AI (Artificial Intelligence), Artificial Intelligence

Alistair Maiden, who created the well-known legal engineering group Syke before selling it to Consilio, has joined legal agent maker Flank in a senior role. …

Syke Founder Alistair Maiden Joins Flank Read More »

This place in India is defying the monsoon crisis with 96% excess rain

ai, AI (Artificial Intelligence), Artificial Intelligence

This place in India is defying the monsoon crisis with 96% excess rain

This place in India is defying the monsoon crisis with 96% excess rain Read More »

MiniMax Sparse Attention (MSA): a Two-Branch Block-Sparse Attention Trained on a 109B-Parameter MoE With a 3T-Token Budget

agentic ai, ai, AI (Artificial Intelligence), AI Infrastructure, AI Paper Summary, AI Shorts, Applications, Artificial Intelligence, Editors Pick, Language Model, Machine Learning, New Releases, Software engineering, Staff, Tech News, Technology

MiniMax released MSA (MiniMax Sparse Attention), a sparse attention method built directly on Grouped Query Attention (GQA). It targets one bottleneck: the quadratic cost of softmax attention at long context. The MiniMax research team tested it inside a 109B-parameter Mixture-of-Experts model trained with native multimodal data. They also open-sourced an inference kernel and shipped a

MiniMax Sparse Attention (MSA): a Two-Branch Block-Sparse Attention Trained on a 109B-Parameter MoE With a 3T-Token Budget Read More »