Fine tuning my first open LLM model
Disclaimer: This is a draft blogpost. I am dumping links, quotes and…
Disclaimer: This is a draft blogpost. I am dumping links, quotes and…

I was poking around LM Studio yesterday, sorted by most downloads, and…

In this article, I'm going to document settting up nanobot on my…
AppleCare is insurance, no different than car insurance. The purpose of AppleCare…
Install Coder Desktop. Approve any system permissions, login, and enable "Coder Connect"…
AI introduced a new rhythm in which workers managed several active threads…
Funny and relatable, at least for me: Source: X.com

How much RAM do you need to run a 30 billion parameter model? Why are there multiple versions of the same model at different file sizes? What does "8-bit quantization" actually mean, and how does it affect performance and/or precision? If you're running language models locally or planning to, understanding the relationship between parameters, weights, quantization, and memory is essential.