LLM Archives

NVIDIA GeForce RTX 5090 & 5080 AI Review

Posted on February 21, 2025 by Jon Allman

How do NVIDIA’s new GeForce RTX 5090 and 5080, released with fanfare regarding their new features and capabilities, perform in real world AI applications?

Exploring GPU Performance Across LLM Sizes

Posted on January 16, 2025 by Jon Allman

Does the size of a Large Language Model affect relative performance when testing a variety of GPUs?

LLM Inference – NVIDIA RTX GPU Performance

Posted on August 22, 2024 by Jon Allman

How do a selection of GPUs from NVIDIA’s professional lineup compare to each other in the llama.cpp benchmark?

LLM Inference – Consumer GPU performance

Posted on August 22, 2024 by Jon Allman

How do a selection of GPUs from NVIDIA’s GeForce series compare to each other in the llama.cpp benchmark?

Tech Primer: What hardware do you need to run a local LLM?

Posted on August 12, 2024 by Jon Allman

What considerations need to be made when starting off running LLMs locally?

LLM

NVIDIA GeForce RTX 5090 & 5080 AI Review

Exploring GPU Performance Across LLM Sizes

LLM Inference – NVIDIA RTX GPU Performance

LLM Inference – Consumer GPU performance

Tech Primer: What hardware do you need to run a local LLM?

Who is Puget Systems?

Browse Systems

Mobile

Workstations

Rackstations

Servers

Storage