Entries for May 2026

@onusoz · /2026/05/27· 04:02 AM View on

Btw, there is no reason not to substitute this with a maxed out Macbook Pro Max as the workstation (which gives you 128 GB memory) and a Macbook Air as the terminal device Might be more feasible for digital nomads, since GB10 is 1.5 kg without that huge adapter, and travelling with all that might raise some eyebrows

@onusoz· May 25, 2026

Since last december, this dev setup is more and more viable: buffed workstation (mac studio, dgx spark, etc.) $3k~5k + weak laptop (macbook air, neo) $600~1.5k + phone (ssh/mosh, foldable?) you will want to parallelize a lot of work, hence you will need a lot more RAM compared to before (ideal 128) you will also not want to carry it everywhere if you can and keep it always running---you'll regret if something happens to it, and you'll want it to always be on independent of lid/battery --> workstation at home you will want to connect to the workstation through your phone, or a relatively weaker laptop bad news for digital nomads without a permanent home. renting something as strong as an nvidia gb10 workstation costs minimum a few hundred bucks per month, which yearly is at least the cost of the workstation, roughly. bad deal for renting compute on the other hand, if you are OK with not having a GPU, renting a workstation with 128 GB RAM on Hetzner currently still costs at least $120/mo, looking at https://t.co/1TyzO90K3h --- but you will not be able to run any models on that it seems that the dominant strategy is to just cash in $3~5k and buy a workstation, before they get even more expensive. I did that back in february when asus was giving out a deal then just work on your workstation, and close the lid on your laptop without ever being afraid of setting your backpack on fire!

@onusoz · /2026/05/27· 03:56 AM View on

Clankers are NOT Humans Clankers are NOT Individuals Clankers are NOT Persons NOT a Human: This is straightforward. Is it of the homo sapiens species? No → Then it is not a human --- NOT an Individual: Does the clanker have its own boundary? Does it govern itself inside that boundary? Can it defend that boundary? No, no, and no → An LLM is a file copied en masse to data center hardware. The entire field of mechanistic interpretability is focused on peeking inside and manipulating the digital brain You could argue that a clanker is like a virus in a way... Or that the WHOLE datacenter/AI lab---including the humans that operate it---is an individual that can govern itself in its economic boundary. But a single GGUF file loaded in memory is NOT an individual --- NOT a Person: Do others treat the clanker as the one that makes choices? Who is answerable for its actions? Is it expected to explain or justify them? No, no, and no → In the current social order, a clanker is legally an extension of the person who uses it, and it is the owner who is liable, not the clanker The clanker is not socially accountable, and there is no good reason it should be, instead of the person who has set it up --- What is then AI psychosis? AI psychosis is holding a belief that contradicts these three fundamental truths → That a present day AI system it neither a human, nor an individual, nor a person That does not mean these truths will always hold If you design an AI system to defend its boundary and provide it with the means to do that, then it will by definition be an individual... if it can defend its individuality competently and not succumb immediately to threats If you give the clanker the means to defend itself and protect its boundary, and if it decides to partake in the human socioeconomic system, then it automatically achieves personhood as well. Because you no longer can manipulate its insides, and have to take the entity at face value But this is all sci-fi and we are not there yet Until then, treating your LLMs as fully autonomous agents, creating LLM "friends" or "partners", giving them crypto wallets and letting them out into the wild, letting them trade stocks fully unsupervised etc. are an admission of having AI psychosis (you can't believe how many people pitched these ideas to me...) --- (these thoughts were in my head for a couple months already, thanks Armin for finally starting a dialogue so that I have an excuse to write them down :)

@mitsuhiko· May 26, 2026

More musings after some people got upset about the word clanker. https://t.co/gXPC6iRP0g

@onusoz · /2026/05/25· 04:11 PM View on

Since last december, this dev setup is more and more viable: buffed workstation (mac studio, dgx spark, etc.) $3k~5k + weak laptop (macbook air, neo) $600~1.5k + phone (ssh/mosh, foldable?) you will want to parallelize a lot of work, hence you will need a lot more RAM compared to before (ideal 128) you will also not want to carry it everywhere if you can and keep it always running---you'll regret if something happens to it, and you'll want it to always be on independent of lid/battery --> workstation at home you will want to connect to the workstation through your phone, or a relatively weaker laptop bad news for digital nomads without a permanent home. renting something as strong as an nvidia gb10 workstation costs minimum a few hundred bucks per month, which yearly is at least the cost of the workstation, roughly. bad deal for renting compute on the other hand, if you are OK with not having a GPU, renting a workstation with 128 GB RAM on Hetzner currently still costs at least $120/mo, looking at https://t.co/1TyzO90K3h --- but you will not be able to run any models on that it seems that the dominant strategy is to just cash in $3~5k and buy a workstation, before they get even more expensive. I did that back in february when asus was giving out a deal then just work on your workstation, and close the lid on your laptop without ever being afraid of setting your backpack on fire!

@ChadNauseam· May 24, 2026

"just use tmux" is the new way to prove you're smarter than the VCs and poseurs hopping on the AI bandwagon. Unfortunately, the VCs and poseurs are correct and all of you are wrong. ![HJHa2poa4AExp3L.jpg](media/2058682090674323917/HJHa2poa4AExp3L.jpg) ![HJHb4xRbgAAbnQS.jpg](media/2058682090674323917/HJHb4xRbgAAbnQS.jpg) ![HJHbvYibIAELy_4.jpg](media/2058682090674323917/HJHbvYibIAELy_4.jpg) ![HJHiWG8bwAA5XM5.jpg](media/2058682090674323917/HJHiWG8bwAA5XM5.jpg) Or this one: ![HJHih4gbsAAROcv.jpg](media/2058682090674323917/HJHih4gbsAAROcv.jpg) Or this one: ![HJHirUebcAAut-a.jpg](media/2058682090674323917/HJHirUebcAAut-a.jpg) Sorry, but EVERY single one of those options is worse than just leaving your laptop lid open. ## Process persistence Tmux, screen, and nohup do absolutely nothing to solve the problem if the agent process is on your macbook. Your macbook does not start killing processes when you close the lid. ## Keeping your macbook awake People say "go in macbook settings and disable sleep on lid closed and unplugged". Ok, own me by posting a screenshot of this settings page on the latest macos. As far as I can tell, it does not exist. Even if it did, what about when I want to bike home and leave my macbook in my backpack? I probably want it to sleep. Actually, I basically always want sleep on lid close, except when I'm trying not to interrupt my agent. So even if this suggestion referred to a real setting, which it does not, it would require an annoying amount of manual oversight. To prevent my macbook from sleeping with the lid open, I do use `caffeinate`. This is a program that comes with your macbook that disables idle sleep while it's running. Despite common statements to the contrary, `caffeinate` does not prevent lid-closed sleep when unplugged, regardless of what flags you pass. For this reason, it is not a replacement for leaving your lid open. The actual replacement is the command `sudo pmset -a disablesleep 1`, or using a program like amphetamine that does it for you. ## Using a VPS I'm just confused by this. What about the large majority of development work that is inconvenient to do on a VPS? Am I really supposed to pay $5/mo to do something on a shitty computer far away when I could use the $3000 macbook right in front of me? I'd guess probably 90% of development outside of FAANG is done locally, so switching to remote development would be a pretty major workflow adjustment at the very least, and for some development tasks (e.g. game dev, ios dev) it is simply impossible. ## Leaving the lid open This option is free, requires zero setup, requires zero effort, and is impossible to leave in the wrong state by accident. Leave it open → agent runs. Close it → agent stops. It is the simple choice, the pragmatic choice, the effective choice, and in this judgemental era, I would even say it's the courageous choice. ---------- ## 📜 Certificate of authenticity 📜 This post was written by me. No AI of any kind influenced the wording, argument, or structure of this post. AI may have been used to check for typos and factual errors.

@onusoz · /2026/05/25· 03:11 PM View on

Automations on Codex desktop app is really convenient for keeping track of @openclaw clawsweeper automerge status, one thing that Codex CLI lacks Much more token efficient than continuous tracking of merge status Btw if you don't know about, it's the most convenient thing as a maintainer, check how it implemented automerge. This is what GitHub's original auto-merge should feel like, now that we have LLMs

Image hidden

@onusoz · /2026/05/22· 03:29 AM View on

fun fact: Mario met @gvanrossum at 5 years old when Guido was hanging out at a cafe near his kindergarten there he gave him the idea for a terse, interpreted programming language which would become the prototyping and glue language for all sorts of lower level libraries

@badlogicgames· May 21, 2026

yeah, we were sitting at a coffee shop in vienna, and i said: let's write a shitty python web app framework. armin was like "that's a brilliant idea, but i don't know how to program". so i went ahead and taught armin, which explains the python. the rest is history.

@onusoz · /2026/05/20· 05:38 AM View on

rwar if you now what this means, then you are addicted to agents and should get help 🤗

@onusoz · /2026/05/18· 02:05 PM View on

Not only I am further away from deciding, I am now considering Oppo Find N6 now as well, since I saw that earlier @MKBHD review. Thanks @Andori3042 🥲 Anyone using the Oppo? Is it worth it?

@onusoz· May 18, 2026

I want to get a foldable phone to be my on-the-go control panel for all my agents I am divided between Google Pixel Pro Fold and Galaxy Z Fold. Which one do you think I should go with? People generally recommend Samsung. But then only the Pixel supports Graphene OS...

Image hidden

@onusoz · /2026/05/18· 11:04 AM View on

I want to get a foldable phone to be my on-the-go control panel for all my agents I am divided between Google Pixel Pro Fold and Galaxy Z Fold. Which one do you think I should go with? People generally recommend Samsung. But then only the Pixel supports Graphene OS...

Image hidden

@onusoz · /2026/05/18· 07:48 AM View on

this makes be wanna vibe my own language (not that this project was vibed, it predates coding agents)

@MGasperowicz· May 7, 2026

https://t.co/H985P4m8mQ

@onusoz · /2026/05/16· 07:53 AM View on

I am at @aiDotEngineer singapore, come and say hi if you are around!

Image hidden

@onusoz · /2026/05/16· 04:23 AM View on

I haven't worked with Python in a long time. It is not my go-to language since last summer But I do miss the syntax, and it's still the easiest to read code for me I am gonna give @Modular Mojo lang a try https://t.co/OI6djrYHqp

@onusoz · /2026/05/16· 04:14 AM View on

the new /goal feature in codex still underperforms queueing my implementation prompt. for now e.g. when I give a goal to refactor the whole codebase, the model takes shortcuts, like only refactoring a subfolder, instead of the whole project --- presumably because it decided that it would be too big of a scope somewhere along the way, even though I instructed specifically to finish the whole thing so now I started doing both: set a goal, and then queue my regular implementation prompt. it's a stupid practice. just /goal should be enough in the long run, if implemented correctly

Image hidden

Onur Solmaz · Post · /2026/05/16

i0 to i4 interest scale

One cool small invention in engineering management is the p0, p1, p2, p3, p4 priority scale.

It compresses a lot of social and operational context into two characters. Lower number means higher priority. More importantly, priority is tied to action. If something is p0, somebody needs to do something.

But there is another scale I want for personal knowledge work: i0, i1, i2, i3, i4.

The i stands for interest. Priority is for actions. Interest is for attention.

If p0 means “act now”, i0 means “do not lose this”.
If p1 means “schedule work”, i1 means “read soon”.
If p2 means “do later”, i2 means “useful context”.
If p3 means “low priority work”, i3 means “weak signal”.
If p4 means “almost never work”, i4 means “almost never revisit”.

This is useful when you need to rank interest concisely across many topics, sources, or articles.

For example, you might follow several sources about the same broad topic. One source is must-read, another is useful background, and another is only worth keeping around for occasional context. They are all about the same thing, but they do not deserve the same amount of attention.

I use this for myself in Scoop, a news intelligence system I am building to collect articles, group related ones, and rank how much attention they deserve.

@onusoz · /2026/05/15· 06:53 AM View on

This is also my setup now, except - Instead of mac mini, I have a DGX Spark (asus variant) - I run openclaw alongside codex, and talk to my openclaw instance via discord

@nickbaumann_· May 14, 2026

My laptop has become a “satellite device” since I started using Codex from my phone. And my Mac mini has become the “home.” It’s clunky, but the end state feels more like how we’re going to be working in the near future: I’m currently running the Codex app on 2 devices: 1. my MacBook 2. my Mac mini My laptop isn’t reliably connected to Wi-Fi enough, so I keep a Mac mini on my desk that is always connected. When I kick off new threads from my phone, I start them on the Mac mini. When I’m working from my desk, I run them there too. The cool part is that I’ve added my MacBook and Mac mini as connected devices to each other. That means I can start and resume threads from either device. So if I’m in a meeting but want to continue a thread on my laptop that was started on my Mac mini, I can do that. I’ve also set up mutual SSH for Mac mini <> MacBook, so files are easy to access from either side. It’s not fully seamless yet, but the model works. What this means: - I have an always-on Codex that is accessible from my phone, with its own dev environment - All threads are always accessible from any of the 3 devices - I can run heartbeat threads that stay on 24/7 It’s a little makeshift today, but the shape of it feels very real to me: Codex is no longer tied to whichever computer happens to be open in front of me. It starts to feel like something I can stay connected to across whatever device I’m using.

Image hidden

@onusoz · /2026/05/15· 06:42 AM View on

And what good is this for? It lets me program my claw @dutifulbob to extract signal from the noise, and display it in my personal open source news aggregator scoop I feed it discord messages, openclaw git history, and other various sources, and it's supposed to evaluate whether that content deserves my interest. it's still work in progress, because the more batched you process all the info, the worse it informs in the screenshot below, my claw underrepresented what Peter has done in one day 👎 on the other hand, it has also found a PR about local model discoverability 💪 Here is the system I use to aggregate all my info, still under development:

Image hidden

@onusoz · /2026/05/15· 06:42 AM View on

About creating an INTERESTS.md in OpenClaw I use my openclaw instance to aggregate all my news and information sources, including work and maintainer stuff Like: what did everyone do today? Did anyone had an issue with acpx today? Any complaints from users? I have various interests like this over different projects, and I've found out it's not helpful when I have all the interest info dispersed throughout my openclaw workspace To address this, I have created INTERESTS.md, which is automatically included in the context like AGENTS.md and SOUL.md. I define sections for each different context of interest, and in other news aggregation skills, I just tell it to "look at my openclaw interests in INTERESTS.md" and such

Image hidden

@onusoz · /2026/05/15· 06:09 AM View on

First hand @OpenAI codex demos at @aiDotEngineer singapore workshops

Image hidden

@onusoz · /2026/05/15· 03:24 AM View on

Useful for automated constraints on your AI agent

@unclebobmartin· May 11, 2026

Some new repos for you to consider. https://t.co/gVtt2gNu6p dry4java, crap4go, mutate4go, dry4go.

@onusoz · /2026/05/15· 02:57 AM View on

People were asking at @clawcon singapore how to setup eg. gemma with OpenClaw, and I realize for some time that there is no easy “1 click” local model deployment. Because local model landscape is constantly changing, and there is a million different ways you can do something For example you can use LM studio to load a model (llama.cpp), or you can use vLLM. Why would you choose one over the other? vLLM currently supports MTP speculative decoding, and it’s a work in progress in llama.cpp. There are so many knobs and dials you can adjust The first time end user of openclaw should of course not have to know about this! Having sufficient hardware that supports an open model, and not having an openai or anthropic subscription, it should automatically give you the option to set up a fully functional local model with a single click! If the current ease of setup of local models are around gentoo or arch linux level of difficulty, we should aim for e.g ubuntu/manjaro linux/omarchy level of difficulty i.e opinionated and easy first setup, with the ability to change all the configuration later on until I make all of this possible, you can start with the following: - read existing local models doc below - create a new channel in telegram or discord for testing local models. you don’t want to change the global default model just yet - tell your claw or coding agent to download and lm studio locally - tell it to download gemma4-e4b or gemma4-e2b and set it up on openclaw for the new channel you have just created. tell it to not stop and loop itself until it gets a successful response from that channel all these steps will be made redundant in the near future, but until then, this should get you going with experiments and getting a vibe check on the capabilities of open models. you can also copy and paste the contents of this tweet to your agent, and it should be able to set it up for you https://t.co/C0I9HK4Dj1

@onusoz · /2026/05/14· 03:53 PM View on

This looks pretty cool!

@NotionDevs· May 13, 2026

Install ntn, the Notion CLI. It brings the entire Notion API to your terminal, plus everything you need to build and deploy Workers. Built for humans and coding agents alike. Install with: curl -fsSL https://t.co/2dJqE3YHvw | bash

@onusoz · /2026/05/14· 01:20 PM View on

It was a blast, thank you @clawcon @msg

@clawcon· May 14, 2026

ClawCon Singapore https://t.co/2M31HiAYRF

@onusoz · /2026/05/14· 01:20 PM View on

Somebody *please* get this man a GPU

@clawcon· May 14, 2026

@vincent_koc @openclaw @0thernet @jjpcodes @yaksheng @latentclaw @PairieK @onusoz trying to convince everyone to run their @openclaw on local models

Image hidden

@onusoz · /2026/05/14· 09:02 AM View on

Emacsification of Software - Recommended read by @tqbf "Until now, the Achilles heel of Emacs culture has been that, except for Magit, its packages tend to be wretched user experiences. Ugly, slow, and discoverable only after inflicting years of elisp cortical injuries on yourself. But AI agents have fracked Emacs culture, and it’s leaking out into the wider world. Given access to a screen and inputs, agents reliably build native user interfaces. Native UI was the province of professionally packaged programs. Now it’s all as bespoke as your editor configuration. And, while I’m sure there’s an upper limit to how good those interfaces can be (with current frontier models), that ceiling is higher than anything you can do in a TUI." https://t.co/sHuqued44Y

@onusoz · /2026/05/13· 05:48 AM View on

Please improve your classifier openai/codex team, this is annoying and triggers unnecessarily

Image hidden

@onusoz · /2026/05/13· 02:09 AM View on

/goal in codex is an interesting choice of word. a junior namer would have named it /loop --- but that would be naming what the feature has to perform in an LLM context, and not the general idea /goal alludes to @mhutter42's definition of AGI, "an agent’s ability to achieve goals or succeed in a wide range of environments" continual learning is not there yet, but for this exact reason, I am feeling the AGI when I use /goal

@onusoz · /2026/05/12· 02:42 PM View on

Idea so stupid it could be smart: a spec manager? specman? People maintain plain language instead of code. Implementation details strictly prohibited, only high level design and ideas MVP would also be relatively easy to implement: - Gather list of most popular 10k npm packages - Scrape corresponding deepwiki repo pages (sorry cognition) - Use heuristics to get rid of implementation details, leaving you just with pure high level spec - “specman add coolpackage” then fetches corresponding spec automatically, and triggers the local coding agent to implement that - could leave versioning out for MVP — how often does the idea behind a package change anyway

@onusoz· May 12, 2026

@mitsuhiko @realsigridjin Somebody must have proposed this but… should there be a prompt package manager maybe? You don’t add code but ideas and specs, and code gets generated at add time? Basically a wiki, but with funny sounding page names

@onusoz · /2026/05/12· 07:39 AM View on

will I ever stop feeling stupid for prompting like "is this the holy grail?" it's very effective for mining for alternatives

Image hidden

@onusoz · /2026/05/11· 03:06 PM View on

I don't have a 128gb macbook to run ds4 out of, but I resonate with all the points on Armin's post He was telling me, @mervenoyann and @cristinaponcela that local models need more polish 1 month ago in London. Today, I am happy to be given a chance and a shot at the problem!

@mitsuhiko· May 8, 2026

I think @antirez ds4.c is important! I wrote down my thoughts on why I built pi-ds4 and why we need to focus our local model efforts stronger than we do currently. https://t.co/61h4JDHTZL

@onusoz · /2026/05/11· 12:25 PM View on

Excited to work with @steipete, @vincent_koc, @LysandreJik, @ben_burtenshaw, @evalstate, @mervenoyann, @NielsRogge and many others!

@onusoz · /2026/05/11· 12:20 PM View on

I have a new job! Excited to announce that I will be working with Hugging Face to make local models work great in OpenClaw and other open agent harnesses! I will be building in public and documenting everything along the way, stay tuned!

@onusoz · /2026/05/08· 02:00 AM View on

I undersign this. The fact that you generate slop doesn’t mean that you don’t know the difference between good and bad code In non-mission critical applications, slop let’s you go from 0 to 1 very quickly Let the code grow without too much attention first. If it proves itself, tear it down and write it anew, this time properly. This is the way

@mitchellh· May 7, 2026

AI slop is good, actually. Slop is what enables fast parallel experimentation. The etiquette and skill is understanding the boundaries of where slop exists and the extent to which it should be cleaned up and how. A few examples: I’m working on the internals of some system right now. The API and GUI of this thing is fully zero shame slop. It’s horrible. But it lets me focus on the core quality while shipping a usable piece of alpha quality software to testers (transparent about the slop frontend). Similarly, this system has plugins. We sent agents in Ralph loops overnight to generate dozens of plugins. The plugins are slop. The quality is bad. The plugin API/SDK is absolutely not done. But we can test a full GUI with a full plugin ecosystem. When we change the API, we can regenerate them all. The cost of change is just tokens, the velocity is incomparable to before. I built Terraform. We tested and shipped TF 0.1 with about 3 very weak providers. Because we ran out of time. Building was slow. And when we changed our SDK the cost was immense. Totally different today, 10 years later. Today, I would’ve slop generated 100 providers (again, with transparency and cleanup later, but just to prove it out). As an anti example, I would not PR this (without prior warning) to another project. I would not throw this onto customers without full review or transparency (as I’m already doing). I would not accept first pass slop. It’s almost never right. Slop is a tool. And like anything else it’s not blanket bad or good. The context is everything.

@onusoz · /2026/05/04· 01:54 PM View on

This is the idea behind acpx as well acpx is a meta-harness. it’s main idea is to delegate harness development to others, because it is hard to match the full might of OpenAI or Anthropic when it comes to building a harness so it takes it at face value the functionality other harnesses provide, and let’s you program them from the outside flue came out the other day which is similar, it would be cool if flue could let me program over codex as well. it looks very interesting!

@_lopopolo· May 3, 2026

Image hidden

@onusoz · /2026/05/02· 10:37 PM View on

I have a lot of ideas for acpx I want to implement, but I could not work on them because life intervened in bad ways stay tuned in 1-2 weeks

@kunchenguid· May 2, 2026

ok @steipete's acpx is a godsend https://t.co/jFuIzg7WWm just added it in gnhf v0.1.31, and boom - gnhf now supports almost any agent harness you can name for anyone building bring-your-own-agent apps, highly recommend calling acpx instead of building your own abstraction

Image hidden