Apple and NVIDIA partner to boost AI language models’ speed and efficiency

Apple has announced a new partnership with NVIDIA aimed at improving the performance of large language models (LLMs).

Apple and NVIDIA partner to boost AI language models' speed and efficiency
Apple and NVIDIA partner to boost AI language models' speed and efficiency

Apple has announced a new partnership with NVIDIA aimed at improving the performance of large language models (LLMs). This collaboration introduces a cutting-edge text generation technique that significantly speeds up AI applications, benefiting industries that rely on powerful language models.

Earlier this year, Apple unveiled a machine learning approach called Recurrent Drafter (ReDrafter). This technology combines two advanced methods: beam search and dynamic tree attention. Beam search works by evaluating multiple potential text sequences at once, allowing for more accurate and diverse results. Meanwhile, tree attention helps streamline these sequences, reducing redundant information and improving processing efficiency. The result is faster and more efficient text generation.

Now, Apple has integrated ReDrafter into NVIDIA’s TensorRT-LLM framework. TensorRT-LLM is a tool designed to optimize large language models running on NVIDIA’s powerful GPUs, which are often used for AI tasks. The integration of ReDrafter into this framework has yielded impressive results, with Apple reporting a 2.7x increase in the speed at which tokens (the basic units of text) are generated. This boost was achieved while running a production model containing tens of billions of parameters, which is a standard size for complex AI models.

The improvements brought by ReDrafter do more than just increase speed. By making the text generation process more efficient, the technology also reduces GPU usage and lowers power consumption, which is particularly valuable for large-scale AI applications. This means that developers can now create AI applications that are faster, more energy-efficient, and more cost-effective to run.

Apple highlighted that the integration of ReDrafter into NVIDIA’s framework is a big step forward for LLMs, as these models are becoming an essential part of many production applications. Reducing computational costs and latency is crucial for businesses relying on these models, especially in real-time services where speed is critical.

Developers interested in using ReDrafter to enhance their own AI projects can access detailed documentation on both Apple’s and NVIDIA’s websites. The collaboration between these two tech giants promises to further accelerate the development of AI-powered applications and make them more efficient than ever before.

Get live Share Market updates, Stock Market Quotes, and the latest India News and business news on Financial Express. Download the Financial Express App for the latest finance news.

This article was first uploaded on December twenty, twenty twenty-four, at fourteen minutes past five in the evening.

Photo Gallery

View All
Market Data
Market Data