Nvidia banking on TensorRT to expand generative AI dominance – Warungku Teknologi
Nvidia looks to build a bigger presence outside GPU sales as it puts its AI-specific software development kit into more applications.
Nvidia announced that it’s adding support for its TensorRT-LLM SDK to Windows and models like Stable Diffusion. The company said in a blog post that it aims to make large language models (LLMs) and related tools run faster.
TensorRT speeds up inference, the process of going through pretrained information and calculating probabilities to come up with a result — like a newly generated Stable Diffusion image. With this software, Nvidia wants to play a bigger part in the inference side of generative AI.
Its TensorRT-LLM breaks down LLMs and lets them run faster on Nvidia’s H100 GPUs. It works with LLMs like Meta’s Llama 2 and other AI models like Stability AI’s Stable Diffusion. The company said by running LLMs through TensorRT-LLM, “this acceleration significantly improves the experience for more sophisticated LLM use — like writing and coding assistants.”
In other words, Nvidia hopes that it will not only provide the GPUs that train and run LLMs but also provide the software that allows models to run and work faster so users don’t seek other ways to make generative AI cost-efficient.
The company said TensorRT-LLM will be “available publicly to anyone who wants to use or integrate it” and can access the SDK on its site.
Nvidia already has a near monopoly on the powerful chips that train LLMs like GPT-4 — and to train and run one, you typically need a lot of GPUs. Demand has skyrocketed for its H100 GPUs; estimated prices have reached $40,000 per chip. The company announced a newer version of its GPU, the GH200, coming next year. No wonder Nvidia’s revenues increased to $13.5 billion in the second quarter.
But the world of generative AI moves fast, and new methods to run LLMs without needing a lot of expensive GPUs have come out. Companies like Microsoft and AMD announced they’ll make their own chips to lessen the reliance on Nvidia.
And companies have set their sights on the inference side of AI development. AMD plans to buy software company Nod.ai to help LLMs specifically run on AMD chips, while companies like SambaNova already offer services that make it easier to run models as well.
Nvidia, for now, remains the hardware leader in generative AI, but it already looks like it’s angling for a future where people don’t have to depend on buying huge numbers of its GPUs.
Info Teknologi Terbaru
laptop asus terbaru, laptop hp terbaru, laptop lenovo terbaru, laptop acer terbaru, laptop asus terbaru 2021, laptop terbaru 2021, harga laptop terbaru, laptop terbaru 2022, laptop terbaru, laptop samsung terbaru, laptop dell terbaru, asus laptop terbaru, laptop lenovo terbaru 2021, harga laptop hp terbaru, harga laptop asus terbaru, laptop toshiba terbaru, harga laptop terbaru 2021, harga laptop lenovo terbaru, laptop xiaomi terbaru, laptop hp terbaru 2021, laptop asus terbaru 2022, harga laptop acer terbaru, laptop lenovo terbaru 3 jutaan, harga laptop asus terbaru 2021, laptop terbaru 2021 dan harganya, laptop samsung terbaru 2021, lenovo laptop terbaru, laptop asus terbaru 2021 harga 5 jutaan, laptop apple terbaru, laptop acer terbaru 2021 dan harganya, laptop asus terbaru warna pink, laptop rog terbaru, laptop asus terbaru 2021 dan harganya, laptop terbaru 2022 dan harganya, laptop dell terbaru 2021, harga laptop terbaru 2022 dan spesifikasinya, laptop hp terbaru 2022, laptop keluaran terbaru, harga laptop lenovo terbaru 2021, laptop hp terbaru 2021 dan harganya, asus terbaru laptop, laptop acer terbaru 2021, laptop asus terbaru 2020 harga 5 jutaan, laptop hp terbaru tipis, harga laptop dell terbaru 2021, harga laptop acer terbaru 2021, laptop zyrex terbaru, laptop asus terbaru 2020, lenovo terbaru laptop, laptop keluaran terbaru 2021




