NVIDIA’s “Chat With RTX” Is A Localized AI Chatbot For Windows PCs Powered By TensorRT-LLM & Available For Free Across All RTX 30 & 40 GPUs

13.02.2024 - 14:46 / wccftech.com / Hassan Mujtaba / Ai

Expanding its AI ecosystem, NVIDIA has introduced "Chat with RTX", a chatbot for Windows PCs that is powered by TensorRT-LLM & available for free on the latest RTX GPUs.

NVIDIA Wants To Replace ChatGPT With Its Own Locally-Available "Chat With RTX" AI Chatbot That's Available For Free On RTX 30 & 40 GPUs

The utility of the "Chat with RTX" chatbot is very simple, it is designed as a localized system which means that you will have a personalized GPT chatbot available to you all the time on your PC without the need to go online. Chat with RTX can be fully personalized by utilizing a dataset that is available locally on your PC and the best part is that it runs across almost all RTX 40 & RTX 30 GPUs.

Related Story NVIDIA Back On Earth After Briefly Beating Alphabet, Amazon On Stock Market

Starting with the details, Chat with RTX leverages NVIDIA's TensorRT-LLM & Retrieval Augmented Generated (RAG) software which was announced for Windows PCs last year & takes full advantage of the RTX acceleration available on RTX hardware to deliver the best possible experience to users. Once again, the application is supported across all GeForce RTX 30 & 40 GPUs with at least 8 GB of video memory.

After downloading "Chat with RTX" for free, users can connect it to a local dataset available on the PC (.txt, .pdf, .doc, .docx, .xml) and connect it to a large language model such as Mistral and Llama 2. You can also add specific URLs for example for YouTube videos or entire playlists to further enhance the dataset search results. After connecting, users can then use Chat With RTX the same way as they would use ChatGPT by running different queries but the results generated will be based entirely on the specific dataset, giving you better responses compared to online methods.

https://cdn.wccftech.com/wp-content/uploads/2024/02/chat-with-rtx-demo-looping-video.mp4

Having an NVIDIA RTX GPU that supports TensorRT-LLM means that you will have all your data and projects available locally rather than saving them in the cloud. This would save time & deliver more precise results. RAG or Retrieval Augamanted Generation is one of the techniques used in making AI results faster by using a localized library that can be filled with the dataset you want the LLM to go through & then leverage the language understating capabilities of that LLM to provide you with accurate results.

NVIDIA states a 5x performance boost with TensorRT-LLM v0.6.0 which will be available later this month. Furthermore, it will also enable support for additional LLMs such as Mistral 7B & Nemotron 3 8B.

You can download NVIDIA's "Chat with RTX" application here. It is supported by both Windows 11 & Windows 10 PCs and requires the latest NVIDIA GPU drivers for optimal

Tags: online performer Videos Provident Markets Software

See full article on wccftech.com

The website gametalkz.com is an aggregator of news from open sources. The source is indicated at the beginning and at the end of the announcement. You can send a complaint on the news if you find it unreliable.

Top Authors

Derby County

Jim Ryan

Tom Clancy

Phil Spencer

Tom Henderson

Geoff Keighley

Mat Piscatella

Playstation Plus

Sam Altman

Harley Quinn

Tom Warren

Naoki Yoshida

Ryan Dinsdale

Will Shen

Todd Howard

Peter Parker

Bandai Namco

Swen Vincke

NVIDIA Blackwell B100 GPUs Coming This Year & Upgraded B200 For 2025’s AI Data Centers, Dell Confirms

NVIDIA's Blackwell AI GPU lineup will include two major accelerators, the B100 for 2024 and the B200 for 2025, as revealed by Dell.

wccftech.com

27.02.2024 / 10:54

Intel Arc Integrated GPU For Core Ultra CPUs Receives Big Boost In Gaming Performance With Latest Drivers, Upto 155% Uplift

Intel's software team has delivered another performance-tuned driver update for the Intel Arc GPUs featured on Core Ultra chips, offering a massive boost in performance in several games.

wccftech.com

24.02.2024 / 13:38

Intel Arc GPUs Now Supported In The Intel Extension For PyTorch, Boosting AI, Deep-Learning & LLM Capabilities

Arc A-Series GPUs are now supported within the Intel Extension for PyTorch (IPEX), offering faster AI capabilities in deep learning & LLMs.

tech.hindustantimes.com

20.02.2024 / 11:15

Superhuman: This AI-powered email tool can save 4 hours for you every week, boost productivity

Do you get anxious about what to write while sending an email or have so many in your inbox that it becomes a nasty and never-ending chore? If yes, then you are not the only one as many are in the same boat. Drafting a formal and informative email could take up a lot of time and there is a negative impact of that - it distracts teams from their important work. While replying to emails is also necessary, crafting a grammatically correct and formal draft could be tricky for many. To solve this problem, we have found a tool called “Superhuman” which is powered by artificial intelligence (AI) and is designed for Gmail and Outlook users. The tool simplifies sending emails and it does so quickly and easily. Know more about the Superhuman email tool here.

tech.hindustantimes.com

20.02.2024 / 11:15

GTA 6 leak hints at AI-powered NPCs leading to smarter interactions; Know what’s coming

Every week, a new GTA 6 leak surfaces revealing exciting details about it. The next Grand Theft Auto game, which is confirmed to be released in 2025, will likely debut with groundbreaking features that take the gameplay experience to the next level. With the GTA 6 trailer, we've already seen the likely open-world, characters, locations and some gameplay elements. One of the more interesting leaks suggests at the possibility of smarter NPCs in GTA 6 which harness the power of artificial intelligence (AI).

wccftech.com

20.02.2024 / 10:38

Intel Clearwater Forest Xeon CPUs With Up To 288 E-Cores To Utilize Foveros Direct 3D Stacking Technology

Intel Clearwater Forest Xeon CPUs will be making use of Foveros Direct technology to 3D Stack up to 288 cores on top of the base tile, says Bionic_Squash.

wccftech.com

19.02.2024 / 10:23

Intel Core i9-14900KS CPU With 6.2 GHz Clocks Listed By French Retailer For €768

A French retailer has listed Intel's upcoming Core i9-14900KS CPU, which will feature a clock speed of up to 6.2 GHz.

tech.hindustantimes.com

19.02.2024 / 08:10

Visme AI-powered presentation tool: Know how to create stunning visual designs effortlessly

Presentations are now a permanent feature in all corporate settings and they have even infiltrated into areas that were once considered anathema for them - visual elements, art and designs. Do you know why making a visually pleasing presentation is necessary? An informative piece of documentation should not always include texts and graphs as it makes the presentation look boring. To make them engaging one must include different types of visual elements such as pictures, videos, animations, icons, illustrations and more. This enables the speaker to draw more attention to the details and keep everyone hooked throughout the presentation. While there are multiple presentation tools available in the market, there is one tool called “Visme” that shines in providing the best presentation editing tools and techniques. Know more about the Visme presentation tool here.

tech.hindustantimes.com

19.02.2024 / 08:10

Revolutionise your work updates with Broadcast - Your new AI-powered productivity partner

In the fast-paced world of business, leaders often find themselves buried under a mountain of meeting notes and action items, struggling to maintain focus on strategic priorities. Broadcast, an innovative communication tool, aims to change that narrative by seamlessly integrating meeting notes with existing workflows, automating mundane tasks, and fostering productivity among engineering, product, and project management teams.

wccftech.com

16.02.2024 / 16:47

NVIDIA Unveils Its Cutting-Edge Eos Supercomputer, A Technological Marvel With 18.4 Exaflops of AI Power

NVIDIA has unveiled its Eos supercomputer, which is a high-performing data-center-scale system targeted towards AI applications.

wccftech.com

15.02.2024 / 12:24

NVIDIA GeForce RTX 2080 Ti GPUs Are Being Equipped With 22 GB VRAM For AI Market, Costs $499 US Per Piece

Individuals, particularly modders, have found a "workaround" to the ongoing AI GPU shortages, which involves tuning consumers' GPUs into a "beefy" accelerator and it looks like the old NVIDIA GeForce RTX 2080 Ti GPUs have been given a second life thanks to this new modding solution.

tech.hindustantimes.com

15.02.2024 / 11:15

Sharing private information with Google Gemini AI? Beware! Know why you must not

Google has recently issued a privacy update for Gemini AI, cautioning users about what they share with the app as well as the retention of conversations and related data. Notably, it will retain user data for up to three years, even if it has been deleted. The Gemini Apps Privacy Hub outlines the specifics of this policy, emphasising the separation of reviewed or annotated conversations and their detachment from Google Accounts.

About Us

SHOW MOREHIDE

GameTalkz - ultimate gaming hub that provides in-depth gaming reviews, expertly crafted walkthroughs, and the latest updates from the gaming industry. Immerse yourself in a lively gaming community, engage in exclusive interviews with industry experts, and embark on exhilarating multiplayer adventures. GameTalkz stands as the preferred destination for gaming enthusiasts, igniting your passion and delivering an enthralling gaming journey.. The biggest video game news, rumors, previews, and other info about the PC, PS4, Xbox, Switch, & mobile titles you play. Stay tuned & well informed 24/7 with us!

Owner: SNOWLAND s.r.o.
Registration certificate 06691200
Address:
Snowland s.r.o.
16200, Na okraji 381/41, Veleslavín, 162 00 Praha 6
Czech Republic

Info