Intel's Lunar Lake 8-Core CPUs have been spotted running in Samsung's next-gen Galaxy Book5 Pro laptops with Arc Battlemage "Xe2" iGPUs.
12.03.2024 - 07:43 / wccftech.com / Hassan Mujtaba
Stability AI has published a new blog post that offers an AI benchmark showdown between Intel Gaudi 2 & NVIDIA's H100 and A100 GPU accelerators. The benchmarks show that Intel's solutions offer great value and can be seen as a respected alternative for customers who are eyeing a fast & readily available solution compared to NVIDIA's offerings.
The AI firm, Stability AI, has been making open models that can handle a diverse range of tasks efficiently. To test this out, Stability AI used two of their models which include Stable Diffusion 3, and did a benchmarking run between the most popular AI Accelerators from NVIDIA and Intel to see how they perform against each other.
In Stability Diffusion 3, the next chapter in the highly popular text-to-image model, Intel's Gaudi 2 AI accelerator delivered some exceptional results. The model ranges from 800M to 8B parameters & it was tested using the 2B parameter version. For comparison, 2 nodes featuring a total of 16 Intel & NVIDIA accelerators were used with a batch size set to 16 per accelerator and a batch size of up to 512. The end result was the Intel Gaudi 2 offering a 56% speedup versus the H100 80GB GPU and a 2.43x speedup versus the A100 80 GB GPU.
The 96 GB HBM capacity also allowed Intel's Gaudi 2 to fit in a batch size of 32 per accelerator for a total batch size of 512. This enabled a further speed of 1,254 images per second, a speed-up of 35% over the 16 Batch Gaudi 2 accelerator, 2.10x over the H100 80GB, and 3.26x over the A100 80 GB AI GPUs.
Further scaling up to 32 nodes (256 accelerators) for both the Gaudi 2 and A100 80 GB GPUs, you see an increase of 3.16x on the Intel solution which can output 49.4 images / second / device versus just 15.6 on the A100 solution.
While training performance is superb on the Gaudi 2 AI accelerators, it looks like NVIDIA still retains hold of the throne in inferencing thanks to its Tensor-RT optimizations which have made huge progress throughout the previous year and the green team is continuously making great strides in this ecosystem. The A100 GPUs are said to produce images up to 40% faster in these particular workloads under the same Stable Diffusion 3 8B model versus the Gaudi 2 accelerators.
Lastly, we have results in the second model which is Stable Beluga 2.5 70B, a fine-tuned version of LLaMA 2 70B. With no extra optimizations and running under PyTorch, the 256 Intel Gaudi 2 AI accelerators achieved
Intel's Lunar Lake 8-Core CPUs have been spotted running in Samsung's next-gen Galaxy Book5 Pro laptops with Arc Battlemage "Xe2" iGPUs.
Intel's next-generation Battlemage "Xe2-HPG" GPUs have potentially been spotted within the SiSoftware Sandra database.
Microsoft has proposed a new method to make ray tracing in games faster with future DXR API updates by leveraging SSDs to limit VRAM usage.
Intel has released its latest Game On driver, 5379 WHQL, which brings another round of huge FPS uplifts for Arc A-Series GPUs & Core Ultra CPUs.
NVIDIA has updated its ray tracing global illumination SDK to the latest feature set in the RTXGI 2.0 update, offering support for new technologies.
AMD has published the first demo of its Radeon RX 7900 XTX "RDNA 3" GPU handling Work Graphs, providing much faster and more efficient rendering.
China recently saw the launch of the KX-7000 CPU family from Zhaoxin which is aimed at desktop PCs for the domestic market segment. These chips have now seen their first benchmarks appear in the Geekbench database.
It looks like Non-Binary LPDDR5X memory is just around the corner as Honor has listed its Intel Core Ultra laptop with 24 GB RAM.
Intel has finally launched what it's calling the world's fastest desktop CPU, the Core i9-14900KS, for $699 US and packing a 6.2 GHz clock.
AMD's Ryzen AI CPUs outshine Intel's Core Ultra chips in new AI benchmarks which showcase LLMs & GenAI workloads.
Intel shared a few more updates on its AI strategy and accelerators including next-gen Gaudi 3 and Falcon Shores which reveal how the company is bringing AI to the enterprise and all aspects of the data center segment with its products and software stack.
Q4 2023 was strong for Intel as the chipmaker shipped 50 million CPUs in the desktop & notebook space, far exceeding AMD & Apple.