NVIDIA's Blackwell AI GPU lineup will include two major accelerators, the B100 for 2024 and the B200 for 2025, as revealed by Dell.
13.02.2024 - 14:46 / wccftech.com / Hassan Mujtaba / Ai
Expanding its AI ecosystem, NVIDIA has introduced "Chat with RTX", a chatbot for Windows PCs that is powered by TensorRT-LLM & available for free on the latest RTX GPUs.
The utility of the "Chat with RTX" chatbot is very simple, it is designed as a localized system which means that you will have a personalized GPT chatbot available to you all the time on your PC without the need to go online. Chat with RTX can be fully personalized by utilizing a dataset that is available locally on your PC and the best part is that it runs across almost all RTX 40 & RTX 30 GPUs.
Starting with the details, Chat with RTX leverages NVIDIA's TensorRT-LLM & Retrieval Augmented Generated (RAG) software which was announced for Windows PCs last year & takes full advantage of the RTX acceleration available on RTX hardware to deliver the best possible experience to users. Once again, the application is supported across all GeForce RTX 30 & 40 GPUs with at least 8 GB of video memory.
After downloading "Chat with RTX" for free, users can connect it to a local dataset available on the PC (.txt, .pdf, .doc, .docx, .xml) and connect it to a large language model such as Mistral and Llama 2. You can also add specific URLs for example for YouTube videos or entire playlists to further enhance the dataset search results. After connecting, users can then use Chat With RTX the same way as they would use ChatGPT by running different queries but the results generated will be based entirely on the specific dataset, giving you better responses compared to online methods.
https://cdn.wccftech.com/wp-content/uploads/2024/02/chat-with-rtx-demo-looping-video.mp4Having an NVIDIA RTX GPU that supports TensorRT-LLM means that you will have all your data and projects available locally rather than saving them in the cloud. This would save time & deliver more precise results. RAG or Retrieval Augamanted Generation is one of the techniques used in making AI results faster by using a localized library that can be filled with the dataset you want the LLM to go through & then leverage the language understating capabilities of that LLM to provide you with accurate results.
NVIDIA states a 5x performance boost with TensorRT-LLM v0.6.0 which will be available later this month. Furthermore, it will also enable support for additional LLMs such as Mistral 7B & Nemotron 3 8B.
You can download NVIDIA's "Chat with RTX" application here. It is supported by both Windows 11 & Windows 10 PCs and requires the latest NVIDIA GPU drivers for optimal
NVIDIA's Blackwell AI GPU lineup will include two major accelerators, the B100 for 2024 and the B200 for 2025, as revealed by Dell.
Intel's software team has delivered another performance-tuned driver update for the Intel Arc GPUs featured on Core Ultra chips, offering a massive boost in performance in several games.
Arc A-Series GPUs are now supported within the Intel Extension for PyTorch (IPEX), offering faster AI capabilities in deep learning & LLMs.
Do you get anxious about what to write while sending an email or have so many in your inbox that it becomes a nasty and never-ending chore? If yes, then you are not the only one as many are in the same boat. Drafting a formal and informative email could take up a lot of time and there is a negative impact of that - it distracts teams from their important work. While replying to emails is also necessary, crafting a grammatically correct and formal draft could be tricky for many. To solve this problem, we have found a tool called “Superhuman” which is powered by artificial intelligence (AI) and is designed for Gmail and Outlook users. The tool simplifies sending emails and it does so quickly and easily. Know more about the Superhuman email tool here.
Every week, a new GTA 6 leak surfaces revealing exciting details about it. The next Grand Theft Auto game, which is confirmed to be released in 2025, will likely debut with groundbreaking features that take the gameplay experience to the next level. With the GTA 6 trailer, we've already seen the likely open-world, characters, locations and some gameplay elements. One of the more interesting leaks suggests at the possibility of smarter NPCs in GTA 6 which harness the power of artificial intelligence (AI).
Intel Clearwater Forest Xeon CPUs will be making use of Foveros Direct technology to 3D Stack up to 288 cores on top of the base tile, says Bionic_Squash.
A French retailer has listed Intel's upcoming Core i9-14900KS CPU, which will feature a clock speed of up to 6.2 GHz.
Presentations are now a permanent feature in all corporate settings and they have even infiltrated into areas that were once considered anathema for them - visual elements, art and designs. Do you know why making a visually pleasing presentation is necessary? An informative piece of documentation should not always include texts and graphs as it makes the presentation look boring. To make them engaging one must include different types of visual elements such as pictures, videos, animations, icons, illustrations and more. This enables the speaker to draw more attention to the details and keep everyone hooked throughout the presentation. While there are multiple presentation tools available in the market, there is one tool called “Visme” that shines in providing the best presentation editing tools and techniques. Know more about the Visme presentation tool here.
In the fast-paced world of business, leaders often find themselves buried under a mountain of meeting notes and action items, struggling to maintain focus on strategic priorities. Broadcast, an innovative communication tool, aims to change that narrative by seamlessly integrating meeting notes with existing workflows, automating mundane tasks, and fostering productivity among engineering, product, and project management teams.
NVIDIA has unveiled its Eos supercomputer, which is a high-performing data-center-scale system targeted towards AI applications.
Individuals, particularly modders, have found a "workaround" to the ongoing AI GPU shortages, which involves tuning consumers' GPUs into a "beefy" accelerator and it looks like the old NVIDIA GeForce RTX 2080 Ti GPUs have been given a second life thanks to this new modding solution.
Google has recently issued a privacy update for Gemini AI, cautioning users about what they share with the app as well as the retention of conversations and related data. Notably, it will retain user data for up to three years, even if it has been deleted. The Gemini Apps Privacy Hub outlines the specifics of this policy, emphasising the separation of reviewed or annotated conversations and their detachment from Google Accounts.