HN Tags

Jul 28, 2025 python programming llm ollama

Alright, all the cool kids are doing cool things with LLMs. Now’s my time to make a mess with them.

Yay, it’s yet another clone of Hacker News? No, not quite that.

Where does this run

I have a terror of accidentally running up some vast AWS bill, so except for the static web hosting everything is running on a tiny Intel NUC box running Debian on my home network. Plus it’s going to be a lot harder for some ne’er-do-well to hack as it just sits on my network and occasionally pushes stuff up to the AWS bucket before triggering a Cloudfront invalidation. Of course now I’ve made this public I suppose I’m just asking for someone to figure out a cunning ploy via a Hacker News comment. Anyway, this mighty beast of a server has an Intel i5-8259U CPU and 8 Gb of memory.

Although I have the decent-ish Nvidia 3090 graphics card in another machine on my home network the space-heater like characteristics of that when I boot it up are such that I don’t want it up and running 24/7. The Intel NUC turns out to be adequate to run the Qwen2.5:1.5b model under Ollama.

Note also that the “running-in-my-home-office” nature of this means that there are zero guarantees for uptime on the site; any time I want to move the box, if I’m on vacation, if the fan gets irritating, anything like that and I’ll either turn it off or reduce the frequency with which it runs.

Limitations

I think the biggest limitation is that it doesn’t really work all that well. It does do the classification of the stories, and the classifications are usually quite reasonable, but it’s hard to get it to do something consistent that I fully agree with.

In particular, my raison d’être of figuring out which are the AI stories didn’t really work out that well, because it tends to be a bit more wordy and drop things I would think of as being just “AI” into finer groups such as “AI Development”, “AI IDEs”, “LLM tooling”, and so on.

I have a few possible approaches in mind to try to squash that down to something sane, but for now I’ll just run with it - they’re not totally crazy distinctions at least.

After

The reason I was wondering about categories in the first place was that I was trying to create a plausible list of bandwagons that the IT industry has leapt aboard over the last 25 years or so. Things like LLMs are just the latest in a long line that included things like minicomputers, microprocessors, Multimedia CD ROMs, Bitcoin, NFTs, Internet of Things, and so on. That’s not to cast aspersions on these things necessarily; there’s no doubt at all that “the web” and “ecommerce” were in that list and they clearly had a lasting impact.

I thought it would be interesting to validate my plausible list by seeing what categories were highly visible to the Hacker News crowd during its reign. This will no doubt be a bit off base, with things like that Erlang incident that we don’t talk about, but it ought to give me a feel for whether I’m completely deluded with the topics in my list.

So at some point, when hntags is categorising to my satisfaction, that article will probably be based on the same logic that I’m using here.

Don’t hold your breath though. Oooh, I saw a shiny thing…

Python

So how am I doing with the Python stuff? Well, my code’s not going to win any awards, but I’m getting a better understanding of python syntax, have a bit of understanding of what the idiomatic way to do stuff is (list comprehensions are quite cool), and I’m sort of getting to grips with the packaging mechanisms. I wouldn’t want to write any commercial code just yet, but I feel like I’m getting there. uv seems quite nice for what it’s worth.

Resources

The main resource for this project is the Github repository and its README. That includes a section on Weaknesses, Fear, Uncertainty, Doubt which is essentially a stream-of-consciousness dump on where this is and where it’s going.

HN Tags

The HNTags.com Website

Categories

Where does this run

The Intel NUC (the top one)

Limitations

After

Python

Resources