Independent software consulting for cloud-native Go applications and applied AI & LLMs:
One of the first things you learn when going down the path of building applications with LLMs is that you need evals. Evals are basically like traditional software tests, but for interactions with the LLMs. Because LLMs are not deterministic, this is typically done by assigning scores between 0 and 1 to results, based on some scoring mechanism. Various platforms for storing and displaying eval results exist already, but wouldn’t it be nice if they could live in your existing test system, and be run and displayed in CI, just like your tests? I’ve explored just that, and in this post, I’ll show you what I found. It involves a SQLite database for tracking results over time, PR comments with Markdown tables, and a lot of Github Actions workflow YAML.
Read the rest of “Running LLM evals right next to your code”
Like the title says: I’m all-in on AI and LLM-based applications. But what does that mean exactly, for a traditional software engineer like myself? Actually, not that much. LLMs are another (very) useful tool for building applications, with their own set of opportunities and challenges.
Read the rest of “I’m all-in on AI & LLMs (but it’s also just another tool)”
You know what’s both intriguing and challenging about the developer consulting world? It’s not just about writing code, it’s about building a whole persona, finding the right clients, while somehow simultaneously also making sense of the rapidly evolving tech landscape. I just had a nice and interesting chat yesterday with another indie consultant from the prompt engineering and LLM space, John Berryman, that got me thinking about this whole journey.
Read the rest of “The art of being an indie software consultant: a chat with John Berryman”
Anthropic unveiled the Model Context Protocol yesterday, a protocol to let LLMs gather context and use tools on servers that use the protocol. Here's a video demo of me using it with the Claude Mac desktop app and the SQLite server.
Read the rest of “Demo: Trying the Model Context Protocol with a SQLite server and Claude”
I’m currently in the market for an LLM eval system / project / pipeline I can use in my Go- and LLM-powered applications. This is a post to gather my thoughts and expose them to the wide internet. Think of it like a stream-of-thought post that might give you some ideas and food for thought as well.
Read the rest of “Requirements for an LLM eval pipeline (in Go?)”
I really like boring technology. Software that has been tested for decades, does what it is supposed to robustly and fast, and is known by every person and machine on the planet. Technology like relational databases and SQL, HTTP, HTML, a well-known programming language. Throw LLMs into that mix, and you've got a potent cocktail!
Like the rest of the world, I’ve embraced chatting to large language models (LLMs) as part of my professional and personal life. I rarely use their output directly, but they help me think and brainstorm, give me ideas, and force me to write out my own thoughts, clarifying and refining them in the process (much like writing articles like this one does, by the way). These thoughts-in-writing are important to me, and that’s why I keep a history of them on my own machine. A conversational diary, so to speak. And so should you.
Read the rest of “I own my LLM chat history, and so should you”
Llamafiles are these cool little files that have llama.cpp and model weights embedded, and can run on Mac/Linux/Windows. Cool, let’s make some!
Read the rest of “TIL: Building llamafiles from Llama 3.2 GGUFs”
As software developers, we are uniquely positioned to create something and give it a life of its own, and then barely having to support it anymore. We can sit in our living rooms in our fancy fluffy pink loungewear (what, you don’t have that?!) and create products that touch and empower thousands of people. So why are so many of us content to clock in and sell our time, 19th century factory style?
Read the rest of “Decoupling time spent from value provided as a software developer”
I want to have my Go modules under my own domain, so instead of importing github.com/maragudk/example, I can import maragu.dev/example. Turns out it’s pretty easy.
Read the rest of “TIL: HTTP middleware for custom Go module paths”
It’s time for the first product! I want to build a web app to help you remember your friends’ birthdays and their kids’ names. Basically, a simple CRM for your personal life. I call it MyFavPeople.
Learn how to download, quantize, and use Llama 3.1 with llama.cpp on a Mac. With word explanations!
Read the rest of “TIL: Quantize and use Llama 3.1 with llama.cpp on a Mac”
I’m going to let you in on a secret, if you promise to not tell anyone. It’s about making money and involves spaghetti. Are you ready?
Always choose the right tool for the job? Nah. I use Go basically everywhere, which either makes me insightful or stupid. Decide for yourself! :D
Read the rest of “Go is my hammer, and everything is a nail”