Compression Techniques

28m

IndexCache, a new sparse attention optimizer, delivers 1.82x faster inference on long-context AI models

Researchers at Tsinghua University and Z.ai built IndexCache to eliminate redundant computation in sparse attention models ...

Micron's stock bounces, as an analyst offers a reality check on the recent panic

Micron Technology shares were on pace to snap a six-session losing streak Friday, with an analyst likening the recent market freakout over memory stocks to last winter's DeepSeek saga that ultimately ...

10h

Memory chip stocks were the big winners of 2026 — until lately. What to do with the shares now

The 10-day selloff reflects investor concern that one of the industry's most powerful tailwinds may not prove as strong as ...

Magnus Midtbø on MSNOpinion

This technique wasn’t meant for this wall - he forced it anyway

Magnus Midtbø approaches an Olympic-style climbing route using an unconventional, physically demanding technique. The ...

16h

Google's TurboQuant compression tech cuts LLM memory use by 6x with no accuracy loss

The biggest memory burden for LLMs is the key-value cache, which stores conversational context as users interact with AI ...

Stock Movers: Venture Global, Sandisk, JBS

Bloomberg journalists discuss today's biggest winners and losers in the stock market. Listen for analysis on the companies ...

Google targets AI inference bottlenecks with TurboQuant

The technique aims to ease GPU memory constraints that limit how enterprises scale AI inference and long-context applications ...

The Next Web

Google’s new compression algorithm cut memory stocks within hours of publication

Google's TurboQuant algorithm compresses LLM key-value caches to 3 bits with no accuracy loss. Memory stocks fell within ...

Stark Insider

Google’s TurboQuant: The Unsexy AI Breakthrough Worth Watching

Forget the parameter race. Google's TurboQuant research compresses AI memory by 6x with zero accuracy loss. It's not ...

Google reveals algorithms to address AI memory challenges; memory and storage stocks drop

Google unveils TurboQuant, PolarQuant and more to cut LLM/vector search memory use, pressuring MU, WDC, STX & SNDK.

Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

Google Research recently revealed TurboQuant, a compression algorithm that reduces the memory footprint of large language ...

MedPage Today

Method for More Anatomically Targeted CPR Comes Up Short

A Taiwanese trial compared transesophageal echocardiography (TEE)-guided CPR with conventional chest compressions in patients ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results