FuriosaAI, a South Korean AI chip startup, celebrates a significant achievement as its energy-efficient chips have been adopted by LG’s AI Research for enhancing their AI accelerator servers. Despite the industry’s typical focus on pure power and speed, it’s the power efficiency of Furiosa’s RNGD (pronounced ‘renegade’) inference accelerators that caught LG’s eye.
Kijeong Jeon from LG AI Research emphasized their decision was influenced by RNGD’s balance of performance and integration simplicity, leading to cost-effective outcomes. RNGD’s modest technical specifications, including 256 to 512 teraFLOPS and 48GB of HBM3 memory, might seem unremarkable against competitors, yet its 180-watt consumption offers exceptional power efficiency, outperforming older Nvidia A100 GPUs by up to 2.25x.
While competition with newer Nvidia models remains challenging in raw performance, FuriosaAI CEO June Paik highlights RNGD’s efficiency due to its unique Tensor Contraction Processor architecture minimizing data movement and instruction use. RNGD’s efficiency, with about 1.4 teraFLOPS per watt, puts it closer to Nvidia’s more recent innovations rather than their older models.
The real-world performance was demonstrated in LG’s Exaone 32B model tests using RNGD cards, meeting stringent performance benchmarks and setting new standards for token generation speeds. However, Furiosa still faces hurdles, notably Nvidia and AMD’s advanced GPUs, offering broader performance and parallelism.
Furiosa’s early architectural choices, like using HBM over GDDR in anticipation of evolving AI model demands, showcase a forward-thinking strategy that positions it competitively in energy efficiency. As the market demands more native solutions, including enterprise-ready models from LG with RNGD support, FuriosaAI’s commitment to sovereignty in AI technology shines through.