Samsung shrinks 30B AI model to fit in your phone

Samsung Research has developed new model compression technology capable of running a 30-billion-parameter AI model on a mobile device using less than 3GB of memory, a significant reduction from the typical 16GB requirement. Dr. MyungJoo Ham of the Samsung Research AI Center explained that the process involves a sophisticated quantization technique, which compresses data like high-efficiency photo compression while preserving performance.

The system identifies and preserves the most critical neural network weights with higher precision, while more aggressively compressing less vital components. This is managed by an AI runtime engine that functions as a control unit, intelligently directing computational tasks to the optimal processor and minimizing memory access to maximize efficiency.

This advancement in on-device AI processing addresses key limitations of cloud dependency, including battery drain, heat generation, and network latency. It also enhances data privacy by keeping user information locally stored. The technology is intended to enable more sophisticated and responsive AI applications directly on personal devices.
 

Attachments

  • Samsung shrinks 30B AI model to fit in your phone.webp
    Samsung shrinks 30B AI model to fit in your phone.webp
    66.1 KB · Views: 35

Trending content

Sponsored

Top