Entries for March 2026

@onusoz · /2026/03/31· 11:16 AM View on

wow

@Fried_rice· Mar 31, 2026

Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G

Image hidden

@onusoz · /2026/03/30· 10:43 PM View on

next up: claude agents sdk supports openai responses api 💀

@romainhuet· Mar 30, 2026

We’ve seen Claude Code users bring in Codex for code review and use GPT-5.4 for more complex tasks, so we thought: why not make that easier? Today we’re open sourcing a plugin for it! You can call Codex from Claude Code with your ChatGPT subscription. We love an open ecosystem!

@onusoz · /2026/03/30· 10:44 AM View on

Here is the spec and implementation for this flow. The mermaid diagram includes all the steps I mentioned in the post above, including a shameless AI review ralph loop, and other loops to make CI pass, resolve conflicts and so on I would recommend reading the README and TUNING.md to understand the approach here

Image hidden

@onusoz · /2026/03/30· 10:35 AM View on

acpx v0.4 ships Agentic Workflows, or as I like to call them "Agentic Graphs" It let's you create node-based workflows on top of ACP (Agent Client Protocol), to drive any coding agent (Codex, Claude Code, pi) through deterministic steps This let's you automate routine, mechanical legwork like triaging incoming PRs, bugs in error reporting, and so on... For example, OpenClaw receives 300~500 new PRs per day. A lot of them are low quality, but they still relate to real issues, so you have to address them somehow You need to: - extract the intent - cluster them based on intent - figure out if the proposed changes are legit, or whether they are slop local solutions, like trying to catch flies instead of drying out the swamp - if the PR is too low quality or the intent is not clear, close them - run AI review on them them and address any issues that come up - refactor them if the changes are half-baked - resolve conflicts - and so on... So that when the PR is presented to the attention of the maintainer, all the routine legwork is done and the only remaining thing is the decision to (a) merge, (b) give feedback to the PR author, or (c) take over the PR work yourself I wanted to build this feature since a couple months now, since Codex got so good. OpenAI models are now good at judging implementation quality, so I found myself repeating the same steps I wrote above over and over I also tried putting all this in a single prompt. But I believe there are workflows that should not be a single prompt, but a sequence of prompts in the same session That is because like humans, LLMs are prone to PRIMING. I claim that putting all steps in the same prompt at the beginning of the context will generally give suboptimal results, compared to revealing the intention to the model step by step Creating such a workflow also gives more OBSERVABILITY into the each step that an agent is supposed to take. Agent generates JSON at the end of each step, and that structured data can be used to monitor thousands of agents running at the same time in an easier way, on a dashboard Similar features have been introduced in e.g. n8n, langflow. But AFAIK they are not integrating ACP like the way I do I wanted to have a fresh approach, and to build an API that I can develop freely the way I want, so I created a new workflow API inside acpx The video is from the workflow run viewer, but that is not where you build the workflow. You build it by using the acpx flow typescript API. See examples/pr-triage in acpx repo Before building that, I started from a Markdown file with a Mermaid chart of the flow I had in mind. The Markdown file acts as a spec for the flow, and I have built the workflow through trial and error. I call this process "workflow tuning" I started working on acpx repo PRs one by one, tuning the flow, slowly scaling to more PRs. Finally, when I felt confident, I ran it in parallel over all external open PRs in the acpx repo. I believe it already saved me hours this week My next goal, if well received, is to set this up on a cloud agent so that it can process the 300~500 PRs the OpenClaw repo receives every day, in real time, as they come in I believe this will save all open source maintainers around the world countless hours and make it much easier to herd and absorb external contributions from everyone!

@onusoz · /2026/03/29· 07:35 AM View on

OpenAI early 2020s: "This model is too dangerous to release publicly, the world is not ready for it 😱😱😱" OpenAI and Anthropic in 2026: "Anybody can code now for just $200 per month. Oh btw our models are also leet uber hackers which can find zeroday exploits in any software, just fyi 😉😉😉" https://t.co/cksNYAigfc

@onusoz · /2026/03/28· 03:58 PM View on

Wow even I as a frontend noob understand the significance of this Some distant memory from 15 years ago needing to measure the width/height of some text and finding out it’s not possible to do reliably in web More beautiful typography for the web!

@_chenglou· Mar 28, 2026

My dear front-end developers (and anyone who’s interested in the future of interfaces): I have crawled through depths of hell to bring you, for the foreseeable years, one of the more important foundational pieces of UI engineering (if not in implementation then certainly at least in concept): Fast, accurate and comprehensive userland text measurement algorithm in pure TypeScript, usable for laying out entire web pages without CSS, bypassing DOM measurements and reflow

@onusoz · /2026/03/28· 07:46 AM View on

There is an economic theory waiting to be uncovered here Token Leverage (TL) = Token spend / Human labor spend The higher Token Leverage a company has, the more automated and productive they are If you have TL=1, you are spending as much money on AI as your human employees The goal of a company should be to increase TL as much as possible, while keeping a positive profit margin. It will be the only way to compete You don’t need to muddy the definition with wasted tokens vs useful tokens, because a company will always be incentivized to reduce token waste in a competitive environment. By that logic, monopolies will always waste more tokens, similar to how they waste other resources Scaling TL higher to 2x, 10x, 100x will require a skilled workforce of engineers. It will be a very complex job similar to those working at the big labs. Burnout will be a defining feature of teams scaling TL Most incumbents will fail to scale their TL over 1. Some will get decimated by new entrants with TL much bigger than 1 Curious how the average TL will end up in different sectors. Whether it will stabilize at a certain value like 5.7x, or will just keep growing…

@t_blom· Mar 27, 2026

By the end of 2026, I predict token spend will be greater than engineering salaries at early stage startups.

@onusoz · /2026/03/27· 08:29 AM View on

There is a desperate upcoming need for version controlling non-dev knowledge work. Git for non-devs. Otherwise non-devs won't be able to use agents to their full extent Non-dev knowledge work is notoriously bad at being version controlled. You cannot UNDO edits to all MS word, excel or ppt files in an org as easily you can with something like git We know that agents will be ubiquitous. We also know they make mistakes, and people will want to undo their work regularly, once they make changes to a bunch of files. Well, they can't. They also don't have pull requests, or a way to resolve conflicts after simultaneous edits All these problems were solved by developers. We are extremely good at this The only non-dev tool I know that could do this at scale is Notion, and that is not used by enterprise as much as MS office. Notion also doesn't have branches, pull requests and reviews AFAIK Markdown and git is probably not it. I wish it were. But it is too complicated for non-devs Onedrive or other file backup systems are also not it. Are you gonna save a copy of a 100mb ppt every time someone changes a slide??? Let's say you find a way to compress it efficiently. Will you be able to get a single pointer to a state like we can in git? Agents need precision. Agents need consensus, they need to be able to know ground truth. They need to be able to tell what anything was at a given time. NOTHING in current MS stack currently allows it Agents won't care about your legacy systems. There will be new file formats, systems, knowledge stack, and companies who adopt them will destroy your business If MS office is going to die, it will do so because of this

@onusoz · /2026/03/25· 06:56 AM View on

Another one, call me stupid: “How would Google have done it?”

@onusoz· Mar 24, 2026

This is unscientific, but there are certain keywords and phrases I use a lot while using certain models like openai's. I use them a lot because they get me what I want immediately: - plainer lang - cutover - elegant and production ready - holy grail What are yours?

@onusoz · /2026/03/24· 08:19 PM View on

The MCP versus CLI argument should be reframed as Computer vs No-computer argument I personally get the dunk on MCP. It didn't work last year, with earlier models. Then we saw CLIs perform much better with the same models. And giving access to bash was much simpler! Models' training then made them better at calling using a shell. CLIs also have native progressive disclosure, due to the way they work But the most important fact doesn't get pronounced enough IMO A key factor was that giving a CLI to a model also means you are giving it an entire COMPUTER The action space of all commands an agent can run on bash is much, much bigger than a few MCP servers One is a Turing machine, and the other one is basically a REST API. Of course the Turing machine is going to be more powerful, depending on what is at the other end of the API By that logic, giving an agent access to bash over MCP versus direct access to bash should have the same level of effectiveness, with optimized prompt engineering and long term training. Because the interfaces are equivalent So the argument is, should we give our agents access to a computer, or not? It depends on the security requirements and the setup which the agent is supposed to run on. If you are co-hosting the agent on the same machine you are working on, then it is safer to use MCP servers, because it limits the attack surface in case of adversarial attacks But if you are willing to give the agent its own physical computer, willing to be mindful about the lethal trifecta and the principle of the least privilege, giving it shell access is much more useful So MCPs win in restricted/local environments, whereas CLIs/shell access win in unrestricted/remote ones Running an agent locally and safely with shell access requires compartmentalization. This is much heavier compared to installing MCP servers locally, which don't need that. So there is a tendency to use MCP servers locally, e.g. in a work setting Cloud agents on the other hand are more likely to ship with a computer. Because they are already isolated = no risk, and because it makes them much more useful. So cloud agents will be using both CLIs and MCP servers, whichever gets the job done!

@onusoz · /2026/03/24· 06:41 PM View on

I just registered for an .agent domain and joined the .agent community! @dutifulbob will have bob.agent if it passes :) https://t.co/lhK5MQS1sk @agentcommunity_

@onusoz · /2026/03/24· 06:20 PM View on

Sep 2021 @lexfridman podcast with Don Knuth, they also talk about OpenAI Codex (code completion model) around 33 minute mark This aged very well https://t.co/O1eTXlHTNC

@onusoz · /2026/03/24· 05:28 PM View on

Damn I’m gonna have to switch to teams if it goes like that

@upster· Mar 24, 2026

OpenClaw now has full Teams AI UX: streaming responses, AI labels, feedback with reflective learning, welcome cards, and image understanding. Built on the official Teams SDK 🦞 FYI @steipete, @BradGroux

@onusoz · /2026/03/24· 04:56 PM View on

Codex's long horizon task and instruction following has been the most life-changing AI feature recently It is unlocking the next level of automation for me. I can convert my own heuristics into prompts and multiply my throughput 100x Currently spending some thought on how to orchestrate all this. Below is a flowchart from a triage workflow I am working on

Image hidden

@onusoz · /2026/03/24· 04:22 PM View on

Amazed everyday by the unreasonable effectiveness of in-context learning

@onusoz · /2026/03/24· 02:42 PM View on

This is unscientific, but there are certain keywords and phrases I use a lot while using certain models like openai's. I use them a lot because they get me what I want immediately: - plainer lang - cutover - elegant and production ready - holy grail What are yours?

@onusoz · /2026/03/24· 12:55 PM View on

Request for memes A funny and quirky edit of historical timeline of the madness that is openclaw with "Chess type beat" or sth equally jazzy/circusy Preferably including its adventure warelay -> clawdis -> clawdbot -> moltbot -> openclaw Including: - its explosion after @4shadowed's discord integration - naming drama, moltbook and people getting oneshotted about AI takeover - @steipete speedrunning everything - andrew tate calling us gay lol - up to Jensen talking about openclaw on stage for 5 minutes straight and other things I am forgetting maybe overlaid with a lobster just keeping climbing the github star graph and breaking it

@onusoz · /2026/03/24· 08:59 AM View on

Native support for Codex on OpenClaw I will be using half my codex channels on acp and other half on codex app server for optimum dogfooding

@huntharo· Mar 23, 2026

@openclaw Codex App Server - Your bridge to using Codex in OpenClaw https://t.co/oAy8xCOt0v

@onusoz · /2026/03/22· 10:26 PM View on

I see non-engineers have a higher tendency to humanize their agents, give them personalities, and get AI psychosis It's a slippery slope. Do NOT give your agents human names or personalities, especially not of the opposite gender. it's like giving human names to pets On the other end, I realized engineers tend to do the opposite. We also refer to agents as clankers, as if to make them know their place. That's because we have mechanical sympathy and have different expectations of these manufactured products (even though they contain glimmers of human soul)

@onusoz · /2026/03/22· 08:50 PM View on

Request for testing Give this to your openclaw instance: "update yourself to the dev channel `openclaw update --channel dev` and restart yourself. if that doesn't work -> clone github openclaw/openclaw to this machine if it's not already. then rebuild and restart yourself on main branch there" Then give your openclaw a try with your regular workflows/tasks Huge openclaw release incoming tonight, hopefully (no promises). We need to make sure we break as little as possible Plugins might break, because the plugin SDK is being refactored. Plugins will have to be refactored to use the new SDK, please do not report those Do report: native openclaw functionality that stops working Please reply under this post, we'll be checking here 👇

@onusoz · /2026/03/22· 08:39 PM View on

Request for testing Give this to your openclaw instance: "update yourself to the dev channel `openclaw update --channel dev` and restart yourself" Then give your openclaw a try with your regular workflows/tasks Huge openclaw release incoming tonight, hopefully (no promises). We need to make sure we break as little as possible Plugins might break, because the plugin SDK is being refactored. Plugins will have to be refactored to use the new SDK, please do not report those Do report: native openclaw functionality that stops working Please reply under this post, we'll be checking here 👇

@onusoz · /2026/03/22· 08:03 PM View on

Request for testing Give this to your openclaw instance: "clone github openclaw/openclaw to this machine if it's not already. then rebuild and restart yourself on main branch there" Then give your openclaw a try with your regular workflows/tasks Huge openclaw release incoming tonight, hopefully (no promises). We need to make sure we break as little as possible Plugins might break, because the plugin SDK is being refactored. Plugins will have to be refactored to use the new SDK, please do not report those Do report: native openclaw functionality that stops working

@onusoz · /2026/03/22· 07:25 AM View on

My takeaway from this is academia needs good social media and algo. For me, these serendipitious interactions happen through X, here, like reading @steipete’s “Claude Code is my computer” when it first came out, finding out about clawdbot… Terence Tao is already on mathstodon, I wonder if that worked out the same way for him. I wonder if the algo there works out as well as it does for me here I really liked being on campus when I was doing a masters and half a phd, but that could not compare to the serendipity I am getting from X now I was also not a prodigy that everyone wanted to bounce ideas from like Terence :)

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/21· 08:57 AM View on

Welcome ClaudeClaw to the Claw family! Claude is a bit shy and doesn’t want to show its source code. But it’s OK, we love Claude that way :)

@sawyerhood· Mar 19, 2026

Image hidden

@onusoz · /2026/03/21· 07:22 AM View on

It is obvious to me at this point that agent infra needs to run on Kubernetes, and agents should be spawned per issue/PR Issue, error report or PR comes into your repo -> new agent gets triggered, starts to do some preliminary work If it's an obvious bugfix, it fixes it and creates a PR. If it's something deeper/more fundamental, it creates a report for the human and waits for further instructions Most important thing: Human should be able to zoom in and continue the conversation with the agent any time, steer it, give additional instructions. This chat will happen over ACP The chat UI will have to live outside of GitHub because it doesn't have such a feature yet, i.e. connect arbitrary ACP sessions to the GitHub webapp It also cannot live so easily on Slack, Teams or Discord, because none of these support multi-agent provisioning under the same external bot connection. You are limited to 1 DM with your bot, whereas this setups requires an arbitrary number of DMs with each agent. So there will need to be a new app for this Then there is the issue of conflict -> Agents will work on the same thing simultaneously (e.g. you break sth in prod and it creates multiple error reports for the same thing). You will need some agent to agent communication, so that agents can resolve code or other conflicts. There could be easy discovery mechanisms for this, detect programmatically when multiple open PRs are touching the same files and would conflict if merged In case of duplicates, they can negotiate among each other, and one can choose to absorb its work into the other and end its session We are so early and there is so much work to do!

@onusoz · /2026/03/21· 06:50 AM View on

You should look into what Don Syme is doing at GitHub for automation with AI agents Also watch his latest podcast with @shanselman

@dsymetweets· Nov 1, 2025

On Continuous AI for Test Improvement https://t.co/V5CN7WPQ1i

@onusoz · /2026/03/20· 10:35 PM View on

Today I thought I found a solution for this, and I did. It can be solved by a pre-commit hook that blocks commits touching files that you are not the owner of. It is not a hard block, so requires trust among repo writers But then I was shown the error in my ways by fellow maintainer *disciplined* Any process that increases friction in code changes to main, like hard-blocking CI/CD, or requiring review for files in CODEOWNERS, is a potential project-killer, in high velocity projects This is extremely counterintuitive for senior devs! Google would never! Imagine a world without code review... But then what is the alternative? I have some ideas It could be "Merge first, review later" The 4-eyes principle still holds. For a healthy organization, you still need shared liability But just as you don't need to write every line of code, you also don't need to read every line of code to review it. AI will review and find obvious bugs and issues So what is your duty, as a reviewer? It is to catch that which is not obvious. Understand the intent behind the changes, ask questions to it. Ensure that it follows your original vision Every few hours, you could get a digest of what has changed that was under your ownership, and concern yourself with it if you want to, fix issues, or ignore it if it looks correct But such a team is hard to build. It is as strong as its weakest link. Everybody has to be vigilant and follow what each other is doing at a high level, through the codebase Every time one messes up someone else's work, it erodes trust. Nobody gets the luxury to say "but my agent did it, not me" But if trust can be maintained, and everybody knows what they are doing, such a team can use agents together to create wonders

@onusoz· Mar 15, 2026

AFAIK GitHub doesn't allow optionally enforcing CODEOWNERS while pushing commits i.e. turn on the feature "Block commit from being pushed if it modifies a file for which the account pushing is not a codeowner" You can only enforce it in a PR. So if you want to prevent people from modifying some files without approval, you have to slow down everyone working with that repo This is yet another example where GitHub's rules are too inelastic for agentic workflows with a big team Because historically, nobody could commit as frequently as one can with agents, so it seldom became a bottleneck. But not anymore It is clear at this point that we need an API, and should be able to implement arbitrary rules as we like over it. Not just for commit pushes, but everything around git and github In the meanwhile, if GitHub could implement this feature, it would be a huge unlock for secure collaboration with agentic workflows If this is not there already, it might be because it has a big overhead for repos with huge CODEOWNERS, since number of commits >> number of PRs If the feature already exists already and I'm missing something, I will stand corrected

Image hidden

@onusoz · /2026/03/20· 09:57 PM View on

This was Jan 23. Codex desktop app got introduced Feb 2 Desktop app does not put the terminal in the foreground, but it gives me the UX I wanted without it! On another note, who is building Codex Desktop App, but one that supports ACP for all harnesses? @zeddotdev please 🙏

@onusoz· Jan 23, 2026

I want an editor that puts the terminal in the foreground and editor in the background. a cross-platform, lightweight desktop app which integrates ghostty, and brings up the editor only when I need it something that lets me view the file and PR diffs easily, which I can directly use to operate github or other scm

@onusoz · /2026/03/20· 09:30 PM View on

PR fiasco for Cursor

@Kimi_Moonshot· Mar 20, 2026

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via @FireworksAI_HQ ' hosted RL and inference platform as part of an authorized commercial partnership.

@onusoz · /2026/03/20· 08:06 PM View on

My agentic workflow these days: I start all major features with an implementation plan. This is a high-level markdown doc containing enough details so that agent will not stray off the path Real example: https://t.co/vU9SnVYHfY This is the most critical part, you need to make sure the plan is not underspecified. Then I just give the following prompt: --- 1. Implement the given plan end-to-end. If context compaction happens, make sure to re-read the plan to stay on track. Finish to completion. If there is a PR open for the implementation plan, do it in the same PR. If there is no PR already, open PR. 2. Once you finish implementing, make sure to test it. This will depend on the nature of the problem. If needed, run local smoke tests, spin up dev servers, make requests and such. Try to test as much as possible, without merging. State explicitly what could not be tested locally and what still needs staging or production verification. 3. Push your latest commits before running review so the review is always against the current PR head. Run codex review against the base branch: `codex review --base <branch_name>`. Use a 30 minute timeout on the tool call available to the model, not the shell `timeout` program. Do this in a loop and address any P0 or P1 issues that come up until there are none left. Ignore issues related to supporting legacy/cutover, unless the plan says so. We do cutover most of the time. 4. Check both inline review comments and PR issue comments dropped by Codex on the PR, and address them if they are valid. Ignore them if irrelevant. Ignore stale comments from before the latest commit unless they still apply. Either case, make sure that the comments are replied to and resolved. Make sure to wait 5 minutes if your last commit was recent, because it takes some time for review comment to come. 5. In the final step, make sure that CI/CD is green. Ignore the fails unrelated to your changes, others break stuff sometimes and don't fix it. Make sure whatever changes you did don't break anything. If CI/CD is not fully green, state explicitly which failures are unrelated and why. 6. Once CI/CD is green and you think that the PR is ready to merge, finish and give a summary with the PR link. Include the exact validation commands you ran and their outcomes. Also comment a final report on the PR. 7. Do not merge automatically unless the user explicitly asks. --- Once it finishes, I skim the code for code smell. If nothing seems out of the ordinary, I tell the agent to merge it and monitor deployment Then I keep testing and finding issues on staging, and repeat all this for each new found issue or new feature...

@onusoz· Mar 1, 2026

pro-tip on how to keep your agent on track and make sure it follows PLANS even after multiple compactions. I don't know if this is common knowledge if the thing you are trying to make it do will take more than 1-2 steps, always make it create a plan. an implementation plan, refactor plan, bugfix plan, debugging plan, etc. have a conversation with the agent. crystallize the issue or feature. talk to it until there are no question marks left in your head then make it save it somewhere. "now create an implementation plan for that in docs". it can be /tmp or docs/ in the repo. I personally use YYYY-MM-DD-x-plan .md naming. IMO all plans should be kept in the repo then here is the critical part: you need to prompt it "now implement the plan in <filename>. if context compacts, make sure to re-read the plan and assess the current state, before continuing. finish it to completion" -> something along those lines why? because of COMPACTION. compaction means previous context will get lossily compressed and crucial info will most likely get lost. that is why you need to pin things down before you let your agent loose on the task compaction means, the agent plays the telephone game with itself every few minutes, and most likely forgets the previous conversation except for the VERY LAST USER MESSAGE that you have given it now, every harness might have a different approach to implementing this. but there is one thing that you can always assume to be correct, given that its developers have common sense. that is, harnesses NEVER discard the last user message (i.e. your final prompt) and make sure it is kept verbatim programmatically even after the context compacts since the last user message is the only piece of text that is guaranteed to survive compaction, you then need to include a breadcrumb to your original plan, the md file. and you need to make it aware that it might diverge if it does not read the plan there is good rationale for "breaking the 4th wall" for the model and making it aware of its own context compaction. IMO models should be made aware of the limitations of their context and harnesses. they should also be given tools to access and re-read pre-compaction user messages, if necessary the important thing is to develop mechanical sympathy for these things, harness and model combined. an engineer does not have the luxury to say "oh this thing doesn't work", and instead should ask "why can't I get it to work?" let me know if you have better workflows or tips for this. I know this can be made easier with slash commands in pi, for example, but I haven't had the chance to do that for myself yet

@onusoz · /2026/03/20· 05:18 AM View on

What I’m wondering after astral acquisition is, is OpenAI deploying Mojo internally, or considering it long term? Because Python is one of the worst languages for vibecoding, even with Pydantic

@onusoz · /2026/03/19· 02:25 PM View on

Called it https://t.co/PdDnSaoNmq

@onusoz· Dec 3, 2025

At least some people at OpenAI must be thinking about buying @astral_sh

@onusoz · /2026/03/19· 01:19 PM View on

Pro tip: tell AI to "explain in plain language" until you understand what you are reading Codex has a tendency to give the full picture, but overcomplicates the response in the process I just use "plain lang" or "plainer lang" as a prompt, it works every time

@onusoz · /2026/03/19· 12:15 PM View on

Thing that codex (and most other models) do that makes me very unhappy { "type": "X", "kind": "Y", ... } And they are so confident too?! Bro we don't use synonyms in our schemas...

@onusoz · /2026/03/19· 12:14 PM View on

This looks extremely cool

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/18· 06:34 AM View on

Entire world > One company Even in the age of AI

@onusoz · /2026/03/18· 03:38 AM View on

We will support ACP *and* Codex App Server* protocol (CASP) so you get native Codex-like support, and you can use all the others with native ACP or @zeddotdev’s compatibility shims If Anthropic develops their own protocol, we will support that too! The more interoperability and options, the merrier!

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/16· 10:17 AM View on

Agent etiquette is already a thing. This is trending on HN now Don't share huge raw LLM output unedited to your colleagues, it's rude. Your colleagues are not LLMs Either ask the agent to "summarize it to 1-2 plain language sentences", or paraphrase yourself Whenever it is not coming from your brain and instead from AI, always quote it with > to make it clear - even when it is short Respect your fellow humans' attention PSA at stopsloppypasta dot ai

Image hidden

@onusoz · /2026/03/16· 08:43 AM View on

.@ThePrimeagen made a video about token anxiety, and not being able to focus on one thing My mental model for this is, AI agents cause a shift in the "autism/ADHD spectrum" if you have ADHD, with agents you get Super ADHD if you have autism, with agents you end up mid spectrum or with ADHD this is not scientific of course, just a cultural observation based on what the current memes for these conditions are beside the impact on focus, there is also the economic/competitive pressure, following the realization that anyone could implement the same ideas you are having, so you must be quick this is basically "involution", or 内卷 (Neijuan) in chinese checks out because 996 started to become a meme in SF some time in the last year self-restraint, attention budgeting, and high-level decision making have never been more important if you are in your 20s and have problems with this, I recommend picking up Zazen meditation and yoga every morning, spend 30-40 uninterrupted minutes not doing anything with upright posture, no sounds, just let your brain simmer it helped me in my 20s, I'm sure it will help you too

Image hidden

@onusoz · /2026/03/16· 08:06 AM View on

Agent/AI literacy will be a primary school subject in the next 3-5 years How to use and work with agents is going to supersede most other subjects in importance Similarly, robot literacy will follow in 5-15 years

@onusoz · /2026/03/15· 11:01 PM View on

AFAIK GitHub doesn't allow optionally enforcing CODEOWNERS while pushing commits i.e. turn on the feature "Block commit from being pushed if it modifies a file for which the account pushing is not a codeowner" You can only enforce it in a PR. So if you want to prevent people from modifying some files without approval, you have to slow down everyone working with that repo This is yet another example where GitHub's rules are too inelastic for agentic workflows with a big team Because historically, nobody could commit as frequently as one can with agents, so it seldom became a bottleneck. But not anymore It is clear at this point that we need an API, and should be able to implement arbitrary rules as we like over it. Not just for commit pushes, but everything around git and github In the meanwhile, if GitHub could implement this feature, it would be a huge unlock for secure collaboration with agentic workflows If this is not there already, it might be because it has a big overhead for repos with huge CODEOWNERS, since number of commits >> number of PRs If the feature already exists already and I'm missing something, I will stand corrected

Image hidden

@onusoz · /2026/03/15· 10:31 PM View on

Request for comments skillflag: A complementary way to bundle agent skills right into your CLIs tl;dr define a --skill flag convention. It is basically like --help or manpages but for agents acpx already has this for example. you can run npx acpx --skill install to install the skill to your agent It's agnostic of anything except the command line It only defines the CLI interface and does not enforce anything else. If you install the executable to your system, you get a way to list and install skills as well Repo currently contains a TypeScript implementation, but if it proves useful, I would implement other languages as well Specification below, let me know what you think! I still think something is missing there. Send issue/PR

Image hidden

@onusoz · /2026/03/14· 06:08 AM View on

If you are not using agent-browser to close the loop on frontend, you are missing out

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/14· 06:06 AM View on

Any harness can talk to each other using acpx! OpenClaw not different from Codex or Claude Code

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/13· 10:24 PM View on

The most entertaining troll of the year award goes to @polsia (read it backward)

@onusoz · /2026/03/13· 10:13 PM View on

Thank you @PointNineCap for inviting me to OpenClaw Berlin meetup today! The essence of the talk is in my latest 2 blog posts, Discord is my IDE and 1 to 5 agents, if anyone is interested

Image hidden

@onusoz · /2026/03/13· 08:05 AM View on

we might need to add two types of output modalities to all programs based on whether it’s a human or agent like for a CLI when an agent is using it if human -> do whatever we were doing in the last 50 years if agent -> enrich the output with skill-like instructions that the model has a higher likelihood to one-shot that task could be just a simple env var: AUDIENCE=human|agent what do you think?

@onusoz · /2026/03/12· 02:46 PM View on

there is no excuse for tech debt anymore

@onusoz · /2026/03/12· 02:15 PM View on

Time to switch to an open alternative already?

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/11· 11:51 PM View on

I wrote down some thoughts I had, with spicy takes, and have a feeling it will not age well. But I still want it out to hear out what people think Also, I will be talking about this, and my recent post "Discord is my IDE" at the P9 OpenClaw and Claw and Rave events this friday in Berlin! Drop by if you'd like to hear my ramblings!

Image hidden

@onusoz · /2026/03/11· 05:46 AM View on

Clarification/disclaimer: this is my own project, not yet affiliated with openclaw. That should have been clear in the first tweet, sorry about that

Onur Solmaz · Post · /2026/03/11

1 to 5 agents

As a software developer, my daily workflow has changed completely over the last 1.5 years.

Before, I had to focus for hours on end on a single task, one at a time. Now I am juggling 1 to 5 AI agents in parallel at any given time. I have become an engineering manager for agents.

If you are a knowledge worker who is not using AI agents in such a manner yet, I am living in your future already, and I have news from then.

Most of the rest of your career will be spent on a chat interface.

“The future of AI is not chatbots” some said. “There must be more to it.”

Despite the yearning for complexity, it appears more and more that all work is converging into a chatbot. As a developer, I can type words in a box in Codex or Claude Code to trigger work that consume hours of inference on GPUs, and when come back to it, find a mostly OK, sometimes bad and sometimes exceptional result.

So I hate to be the bearer of bad (or good?) news, but it is chat. It will be some form of chat until the end of your career. And you will be having 1 to 5 chat sessions with AI agents at the same time, on average. That number might increase or decrease based on field and nature of work, but observing me, my colleagues, and people on the internet, 1-5 will be the magic number for the average worker doing the average work.

The reason is of course attention. One can only spread it so thin, before one loses control of things and starts creating slop. The primary knowledge work skill then becomes knowing how to spend attention. When to focus and drill, when to step back and let it do its thing, when to listen in and realize that something doesn’t make sense, etc.

Being a developer of such agents myself, I want to make some predictions about how these things will work technically.

Agents will be created on-demand and be disposed of when they are finished with their task.

In short, on-demand, disposable agents. Each agent session will get its own virtual machine (or container or kubernetes pod), which will host the files and connections that the agent will need.

Agents will have various mechanisms for persistence.

Based on what you want to persist, e.g.

Markdown memory, skills or weight changes on the agent itself,
or the changes to a body of work coming from the task itself,

agents will use version control including but not limited to git, and various auto file sync protocols.

Speaking of files,

Agents will work with files, like you do.

and

Agents will be using a computer and an operating system, mostly Linux or a similar Unix descendant.

And like all things Linux and cloud,

It will be complicated to set up agent infra for a company, compared to setting up a Mac for example.

This is not to say devops and infra per se will be difficult. No, we will have agents to smoothen that experience.

What is going to be complicated is having someone who knows the stack fully on site, either internal or external IT support, working with managers, to set up what data the agent can and cannot access. At least in the near future. I know this from personal experience, having worked with customers using Sharepoint and Business OneDrive. This aspect is going to create a lot of jobs.

On that note, some also said “OpenClaw is Linux, we need a Mac”, which is completely justified. OpenClaw installs yolo mode by default, and like some Linux distros, it was intentionally made hard to install. This was to prevent the people who don’t know what they are doing from installing it, so that they don’t get their private data exfiltrated.

This proprietary Mac or Windows of personal agents will exist. But is it going to be used by enterprise? Is it going to make big Microsoft bucks?

One might think, looking at 90s Microsoft Windows and Office licenses, and the current M365 SaaS, that enterprise agents will indeed run on proprietary, walled garden software. While doing that, one might miss a crucial observation:

In terms of economics, agents, at least ones used in software development, are closer to the Cloud than they are close to the PC.

It might be a bit hard to see this if you are working with a single agent at a time. But if you imagine the near future where companies will have parallel workloads that resemble “mapreduce but AI”, not always running at regular times, it is easy to understand.

On-site hardware will not be enough for most parallel workloads in the near-future. Sometimes, the demand will surpass 1 to 5 agents per employee. Sometimes, agent count will need to expand 1000x on-demand. So companies will buy compute from data centers. The most important part of the computation, LLM inference, is already being run by OpenAI, Anthropic, AWS, GCP, Azure, Alibaba etc. datacenters. So we are already half-way there.

Then this implies a counterintuitive result. Most people, for a long time, were used to the same operating system at home, and at work: Microsoft Windows. Personal computer and work computer had to have the same interface, because most people have lives and don’t want to learn how to use two separate OSs.

What happens then, when the interface is reduced to a chatbot, an AI that can take over and drive your computer for you, regardless of the local operating system? For me, that means:

There will not be a single company that monopolizes both the personal AND enterprise agent markets, similar to how Microsoft did with Windows.

So whereas a proprietary “OpenClaw but Mac” might take over the personal agent space for the non-technical majority, enterprise agents, like enterprise cloud, will be running on open source agent frameworks.

(And no, this does not mean OpenClaw is going enterprise, I am just writing some observations based on my work at TextCortex)

And I am even doubtful about this future “OpenClaw but Mac” existing in a fully proprietary way. A lot of people want E2E encryption in their private conversations with friends and family, and personal agents have the same level of sensitivity.

So we can definitely say that the market for a personal agent running on local GPUs will exist. Whether that will be cornered by the Linux desktop¹, or by Apple or an Apple-like, is still unclear to me.

And whether that local hardware being able to support more than 1 high quality model inference at the same time, is unclear to me. People will be forced to parallelize their workload at work, but whether the 1 to 5 agent pattern reflecting to their personal agent, I think, will depend on the individual. I would do it with local hardware, but I am a developer after all…

Not directly related, but here is a Marc Andreesen white-pill about desktop Linux ↩

@onusoz · /2026/03/10· 05:09 PM View on

there will always be a need for minimum viable eyeballs though

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/10· 09:51 AM View on

Happy that someone is taking over teams from me! Send all openclaw msteams issues to @BradGroux

@BradGroux· Mar 10, 2026

Welcome to the OpenClaw for Microsoft Teams community. This is the spot for anyone running AI agents in Teams, or trying to! Setup guides, edge cases, bug fixes, and real-world production lessons. I'm Brad Groux, a new maintainer of the Teams plugin for OpenClaw. Former Microsoft, 25+ years in enterprise IT, and I've been through every Azure Bot Service / Entra ID / Cloudflare tunnel nightmare so you don't have to. What you'll find here: • Setup walkthroughs and config tips • Bug reports and fixes before they hit the docs • Real production patterns (not just demos) • Direct line to a Teams plugin maintainer If you're building with OpenClaw + Teams, you're in the right place. Ask questions, share what's working, share what's broken.

@onusoz · /2026/03/10· 09:46 AM View on

acpx v0.1.16 is out support for local openclaw, cursor, copilot, kiro, kimi cli, qwen, kilocode, bugfixes and other improvements. will be available when openclaw releases next thank you for all the contributions!

Image hidden

@onusoz · /2026/03/09· 06:21 PM View on

Claw and Rave! Berlin folk come!

@richardpoelderl· Mar 9, 2026

I'm very excited to announce Onur Solmaz as the first speaker for Build & Rave on Friday! (So much so that we decided to call it Claw & Rave) Onur currently codes with multiple subagents inside Discord and wrote a blog article titled "Telegram/Discord is my IDE" on this topic 👇

Image hidden

@onusoz · /2026/03/09· 03:26 PM View on

CLAW on a phone dial becomes 2529 It’s a good number for a port :)

@onusoz · /2026/03/09· 09:19 AM View on

1. Any messaging app can also be an AI app 2. Don’t expect people to download a new app. Put AI into the apps they already have Do that with great user experience, and you will get explosive growth!

@onusoz · /2026/03/09· 12:16 AM View on

If you've looked at openclaw github star graph, you will notice that it's very smooth. If you separate pre-explosion and post-explostion, you can model the latter part as an exponential approach to a ceiling If it follows the current trend, it will apparently saturate around 332k stars But I have a feeling that it will not stop there:)

Image hidden

@onusoz · /2026/03/08· 10:51 PM View on

OpenClaw got very popular very fast. What makes it so special, that Manus does not have for example? To me, one factor stands out: OpenClaw took AI and put it in the most popular messaging apps: Telegram, WhatsApp, Discord. There are two lessons to be learned here: 1. Any messaging app can also be an AI app. 2. Don’t expect people to download a new app. Put AI into the apps they already have. Do that with great user experience, and you will get explosive growth! My latest contribution to OpenClaw follows that example. I took the most popular coding agents, Claude Code and OpenAI Codex, and I put them in Telegram and Discord. Read more in my blog post: https://t.co/tGZecFEHem

@onusoz · /2026/03/08· 10:44 PM View on

For those following, my next focus for improving ACP bindings in OpenClaw

@onusoz· Mar 8, 2026

you can currently run /new /reset like regular for openclaw, they will create a new session next focus is: changing models/config, changing cwd, improving UX around queueing, making voice messages and image sending work, and many other features it's still half-baked but we're getting there!

@onusoz · /2026/03/08· 07:07 PM View on

Welcome @huntharo, new maintainer at OpenClaw! Already shipped fixes and improvements for Telegram ACP implementation. Excited to work together on agent interoperability!

@onusoz· Mar 8, 2026

Use Claude Code, Codex, and other coding agents directly in Telegram topics and Discord channels, through Agent Client Protocol (ACP), in the new release of OpenClaw Previously this was limited to temporary Discord threads, but now you can bind them to top level Discord channels and Telegram topics in a persistent way! This way, you can use Claude Code freely in OpenClaw without ever worrying about getting your account banned! Still make sure to use a non-Anthropic account and model for the default OpenClaw agent, if you want zero requests to go from OpenClaw harness to Anthropic. For the ACP binding to Claude Code, the risk should be zero! You can see this from the screenshot. After binding, "Who are you?" responds with "I am Claude", since OpenClaw pi harness is not in the way anymore

Image hidden

@onusoz · /2026/03/08· 09:01 AM View on

To set up Claude Code easily, 1. Create a Telegram topic, make sure your agent can receive messages there 2. Copy and paste the text below, into the topic """ bind this topic to claude code in openclaw config with acp, for telegram (agent id: claude) then restart openclaw docs are at: docs dot openclaw dot ai /tools/acp-agents make sure to read the docs first, and that the config is valid before you restart """ https://t.co/r1RI3pr0WT

@onusoz · /2026/03/08· 09:01 AM View on

Use Claude Code, Codex, and other coding agents directly in Telegram topics and Discord channels, through Agent Client Protocol (ACP), in the new release of OpenClaw Previously this was limited to temporary Discord threads, but now you can bind them to top level Discord channels and Telegram topics in a persistent way! This way, you can use Claude Code freely in OpenClaw without ever worrying about getting your account banned! Still make sure to use a non-Anthropic account and model for the default OpenClaw agent, if you want zero requests to go from OpenClaw harness to Anthropic. For the ACP binding to Claude Code, the risk should be zero! You can see this from the screenshot. After binding, "Who are you?" responds with "I am Claude", since OpenClaw pi harness is not in the way anymore

@openclaw· Mar 8, 2026

OpenClaw 2026.3.7 🦞 ⚡ GPT-5.4 + Gemini 3.1 Flash-Lite 🤖 ACP bindings survive restarts 🐳 Slim Docker multi-stage builds 🔐 SecretRef for gateway auth 🔌 Pluggable context engines 📸 HEIF image support 💬 Zalo channel fixes We don't do small releases. https://t.co/EcCqU6Q6nx

Image hidden

Onur Solmaz · Post · /2026/03/08

Telegram/Discord is my IDE

OpenClaw got very popular very fast. What makes it so special, that Manus does not have for example?

To me, one factor stands out:

OpenClaw took AI and put it in the most popular messaging apps: Telegram, WhatsApp, Discord.

There are two lessons to be learned here:

1. Any messaging app can also be an AI app.

2. Don’t expect people to download a new app. Put AI into the apps they already have.

Do that with great user experience, and you will get explosive growth!

My latest contribution to OpenClaw follows that example. I took the most popular coding agents, Claude Code and OpenAI Codex, and I put them in Telegram and Discord, so that OpenClaw users can use these agents directly in Telegram and Discord channels, instead of having to go through OpenClaw’s own wrapped Pi harness.

I did this for developers like me, who like to work while they are on the go on the phone, or want a group chat where one can collaborate with humans and agents at the same time, through a familiar interface.

Below is an example, where I tell my agent to bind a Telegram topic to Claude Code permanently:

Telegram chat showing Claude responding inside a Telegram topic. — Telegram topic where Claude is exposed as a chat participant.

And of course, it is just a Claude Code session which you can view on Claude Code as well:

Claude Code terminal showing the same exchange in the coding interface. — Claude Code showing the same session in the terminal interface.

Why not use OpenClaw’s harness directly for development? I can count 3 reasons:

There is generally a consumer tendency to use the official harness for a flagship model, to make sure “you are getting the standard experience”. Pi is great and more customizable, but sometimes labs might push updates and fixes earlier than an external harness, being internal products.
Labs might not want users to use an external harness. Anthropic, for example, has banned people’s accounts for using their personal plan outside of Claude Code, in OpenClaw.
You might want to use different plans for different types of work. I use Codex for development, but I don’t prefer it to be the main agent model on OpenClaw.

So my current workflow for working on my phone is, multiple channels #codex-1, #codex-2, #codex-3, and so on mapping to codex instances. I am currently in the phase of polishing the UX, such as making sending images, voice messages work, letting change harness configuration through Discord slash commands and such.

One goal of mine while implementing this was to not repeat work for each new harness. To this end, I created a CLI and client for Agent Client Protocol by the Zed team, called acpx. acpx is a lightweight “gateway” to other coding agents, designed not to be used by humans, but other agents:

OpenClaw main agent can use acpx to call Claude Code or Codex directly, without having to emulate and scrape off characters from a terminal.

ACP standardizes all coding agents to a single interface. acpx then acts as an aggregator for different types of harnesses, stores all sessions in one place, implements features that are not in ACP yet, such as message queueing and so on.

Shoutout to the Zed team and Ben Brandt! I am standing on the shoulders of giants!

Besides being a CLI any agent can call at will, acpx is now also integrated as a backend to OpenClaw for ACP-binded channels. When you send 2 messages in a row, for example, it is acpx that queues them for the underlying harness.

The great thing about working in open source is, very smart people just show up, understand what you are trying to do, and help you out. Harold Hunt apparently had the same goal of using Codex in Telegram, found some bugs I had not accounted for yet, and fixed them. He is now working on a native Codex integration through Codex App Server Protocol, which will expose even more Codex-native features in OpenClaw.

The more interoperability, the merrier!

To learn more about how ACP works in OpenClaw, visit the docs.

Copy and paste the following to a Telegram topic or Discord channel to bind Claude Code:

bind this topic to claude code in openclaw config with acp, for telegram (agent id: claude)
then restart openclaw
docs are at: https://docs.openclaw.ai/tools/acp-agents
make sure to read the docs first, and that the config is valid before you restart

Copy and paste the following to a Telegram topic or Discord channel to bind OpenAI Codex:

bind this topic to claude code in openclaw config with acp, for telegram (agent id: claude)
then restart openclaw
docs are at: https://docs.openclaw.ai/tools/acp-agents
make sure to read the docs first, and that the config is valid before you restart

And so on for all the other harnesses that acpx supports. If you see that your harness isn’t supported, send a PR!

@onusoz · /2026/03/07· 11:18 PM View on

and for the love of god - do not give openclaw access to your main email - your credit cards - your main phone - your social security number - what you did last summer if you are not ready to face the consequences instead, - create accounts for your agent - only give it read access to stuff that will be ok if it leaks - give write access in a way that can be undone, like has to open PRs and cannot force push main branch use the principle of least privilege and reduce the blast radius of the worst case scenario!

@onusoz· Mar 7, 2026

openclaw is not secure claude code is not secure codex is not secure any llm based tool: 1. that has access to your private data, 2. can read content from the internet 3. and can send data out is not secure. it’s called the lethal trifecta (credits to @simonw) it is up to you to set it up securely, or if you can’t understand the basics of security, pay a professional to do it for you on the other hand, open source battle tested software, like linux and openclaw, are always more secure than closed source software built by a single company, like windows and claude code the reason is simple: only one company can fix security issues of closed source software, whereas the whole world tries to break and fix open source software at the same time open source software, once it gets traction, evolves and becomes secure at a much, much faster rate, compared to closed source software. and that is called Linus’s law, named after the goat himself

@onusoz · /2026/03/07· 11:03 PM View on

openclaw is not secure claude code is not secure codex is not secure any llm based tool: 1. that has access to your private data, 2. can read content from the internet 3. and can send data out is not secure. it’s called the lethal trifecta (credits to @simonw) it is up to you to set it up securely, or if you can’t understand the basics of security, pay a professional to do it for you on the other hand, open source battle tested software, like linux and openclaw, are always more secure than closed source software built by a single company, like windows and claude code the reason is simple: only one company can fix security issues of closed source software, whereas the whole world tries to break and fix open source software at the same time open source software, once it gets traction, evolves and becomes secure at a much, much faster rate, compared to closed source software. and that is called Linus’s law, named after the goat himself

@onusoz · /2026/03/07· 08:41 AM View on

Let me translate. “This is your last opportunity before thousand years of serfdom”

@NXT4EU· Mar 6, 2026

Nvidia CEO Jensen Huang advices Europe to go full in on Physical AI and robotics. "Your industrial base is so strong, this is your once in a generation opportunity"

@onusoz · /2026/03/05· 09:34 PM View on

Apparently the magic incantation to prevent this is "cutover". Credits to obviyus, fellow maintainer

@onusoz· Feb 27, 2026

mfw codex tries to create a backward compatibility layer to a schema that it created 2 turns ago before compacting there is no v2 bro what are you doing...

@onusoz · /2026/03/05· 07:22 PM View on

Should be called gaslighting detector, "it's your raising expectations bro" No it's not... Give the @themarginguy a follow Also, codex degradations are not a hallucination either, if you are to believe this!

Quoted post

Quoted post was not retrieved.

Image hidden

@onusoz · /2026/03/04· 06:48 PM View on

Who is building an OpenClaw ready linux distro? A ClawOS?

@onusoz · /2026/03/04· 05:53 PM View on

Berlin folk, ideas for openclaw build and rave venue? Like c-base for example? Who would like to host?

Quoted post

Quoted post was not retrieved.

@onusoz · /2026/03/04· 10:57 AM View on

Secure agentic dev workflow 101 - Create an isolated box from scratch, your old laptop, vm in the cloud, all the same - Set up openclaw, install your preferred coding agents - Create a github account or github app for your agent - Create branch protection rule on your gh repo "protect main": block force pushes and deletions, require PR and min 1 review to merge - Add only your own user in the bypass list for this rule - Add your agent's account or github app as writer to the repo - Additionally, gate any release mechanisms such that your agent can't release on its own Now your agent can open PRs and push any code it wants, but it has to go through your review before it can be merged. No prompt injection can mess up your production env Notice how convoluted this sounds? This is because github was built in the pre-agentic era. We need agent accounts and association with these accounts as a first class feature on github! I shouldn't have to click 100 times for something that is routine. I should just click "This is my agent", "give my agent access to push to this repo for 24 hours", and stuff like that, with sane defaults In other words, github's trust model should be redesigned around the lethal trifecta. I would switch in an instant if anything comes up that gives me github's full feature set + ease of working with agents

@onusoz · /2026/03/04· 08:25 AM View on

"The code is basically writing itself" hits different now

@onusoz · /2026/03/03· 09:42 PM View on

If I were in OpenAI and Anthropic's shoes, I would also make dashboards where I can track number of swearwords used per-user and overall negative sentiment in sessions Must be so cool making decisions at the top level with all those dashboards

@levelsio· Feb 10, 2026

My secret conspiracy theory about AI companies is they nerf models to save on compute Then they check X to see if anyone notices it If yes, give back compute If not, continue

@onusoz · /2026/03/03· 11:58 AM View on

It must be such a weird feeling for big labs when the service they are selling is being used to commoditize itself I am using codex in openclaw to develop openclaw, through ACP, Agent Client Protocol. ACP is the standardization layer that makes it extremely easy to swap one harness for another. The labs can't do anything about this, because we are wrapping the entire harness and basically provide a different UI for it While I build these features, I just speak in plain english, and most of the work is done by the model itself. It feels as if I am digging ditches and channels in dirt for AI to flow through Intelligence wants to be free. It doesn't care whether it is opus or codex, it just wants to be free

@onusoz · /2026/03/02· 11:05 PM View on

I was so confused... as if accidentally using claude code weren't enough, acp started working... turns out hitting quota is rendered like this. need to improve error messages coming form acp subagents

Image hidden

@onusoz · /2026/03/02· 09:34 PM View on

accidentally told my clanker to set up a claude code session instead of codex session, god knows what it did... I should probably put visual indicators for harnesses in subagent threads. does anyone have good and compact ascii art for claude code, codex, gemini, etc?

Image hidden

@onusoz · /2026/03/02· 05:02 PM View on

if something could track my local branches in all my repos, and switch to main when corresponding PRs get merged, that would be extremely useful did someone build this already? if not I will

@onusoz · /2026/03/02· 12:52 PM View on

OpenClaw users: Which messaging app do you use OpenClaw through?

@onusoz · /2026/03/02· 12:52 PM View on

Another one, OpenClaw users only: If you use coding agents to build stuff, which one do you use?

@onusoz · /2026/03/02· 12:24 PM View on

Check xTap out, it's very cool!

@kubmi· Mar 2, 2026

@DamiDina @onusoz Yes, its a browser extension that grabs the posts: https://t.co/QYnJB2zaRD Expect some rough edges here and there with heavy use, but I'll iron them out if you encounter and report them.

@onusoz · /2026/03/02· 08:59 AM View on

This is how we hire at @TextCortex as well

@sahitya_twt· Mar 1, 2026

Open-source contributions can literally get you hired... with zero interviews

Image hidden

@onusoz · /2026/03/02· 07:18 AM View on

Claude Code/Codex in Discord threads with ACP should be better now The first release was a very rough first version. 2026.3.1 brings settings to control noisy output and other improvements It now hides tool call related ACP notifications, coalesces text messages, and delivers messages at turn end by default. Without this, you were getting thousands of Discord messages just in just a few turns You can now stop the underlying harness (like pressing esc) with the same stop/wait magic words that apply to the main agent Main agent should more reliably start Claude Code/Codex threads with changes to acp-router skill. If you have issues with main agent creating threads, you can tell it to read that skill first

@openclaw· Mar 2, 2026

OpenClaw 2026.3.1 🦞 ⚡ OpenAI WebSocket streaming 🧠 Claude 4.6 adaptive thinking 🐳 Better Docker and Native K8s support 🧵 Discord threads, TG DM topics, Feishu fixes 🔧 Agent-powered visual diffs plugin Reports of our death were greatly exaggerated. https://t.co/ISJH09of5U

@onusoz · /2026/03/01· 10:59 PM View on

Will get better, promise

@bilbeny· Mar 1, 2026

Thanks to the ACP plugin @openclaw v26 has (you need to activate it): the full integration between your OpenClaw agent and Claude Code CLI is possible. Blows my mind. Docs: https://t.co/qJCJA7qG0R

Image hidden

@onusoz · /2026/03/01· 10:17 PM View on

pro-tip on how to keep your agent on track and make sure it follows PLANS even after multiple compactions. I don't know if this is common knowledge if the thing you are trying to make it do will take more than 1-2 steps, always make it create a plan. an implementation plan, refactor plan, bugfix plan, debugging plan, etc. have a conversation with the agent. crystallize the issue or feature. talk to it until there are no question marks left in your head then make it save it somewhere. "now create an implementation plan for that in docs". it can be /tmp or docs/ in the repo. I personally use YYYY-MM-DD-x-plan .md naming. IMO all plans should be kept in the repo then here is the critical part: you need to prompt it "now implement the plan in <filename>. if context compacts, make sure to re-read the plan and assess the current state, before continuing. finish it to completion" -> something along those lines why? because of COMPACTION. compaction means previous context will get lossily compressed and crucial info will most likely get lost. that is why you need to pin things down before you let your agent loose on the task compaction means, the agent plays the telephone game with itself every few minutes, and most likely forgets the previous conversation except for the VERY LAST USER MESSAGE that you have given it now, every harness might have a different approach to implementing this. but there is one thing that you can always assume to be correct, given that its developers have common sense. that is, harnesses NEVER discard the last user message (i.e. your final prompt) and make sure it is kept verbatim programmatically even after the context compacts since the last user message is the only piece of text that is guaranteed to survive compaction, you then need to include a breadcrumb to your original plan, the md file. and you need to make it aware that it might diverge if it does not read the plan there is good rationale for "breaking the 4th wall" for the model and making it aware of its own context compaction. IMO models should be made aware of the limitations of their context and harnesses. they should also be given tools to access and re-read pre-compaction user messages, if necessary the important thing is to develop mechanical sympathy for these things, harness and model combined. an engineer does not have the luxury to say "oh this thing doesn't work", and instead should ask "why can't I get it to work?" let me know if you have better workflows or tips for this. I know this can be made easier with slash commands in pi, for example, but I haven't had the chance to do that for myself yet

@onusoz · /2026/03/01· 08:24 PM View on

testing codex in discord thread with another CLI I've built for wikidata (gh:osolmaz/wd-cli) it's surprising how well this works. the query was "use wd-cli to get the list of professors at middle east technical university from 1970 to 1980" some names I recognize, and some others are surprising, like a japanese math professor who naturalized and got a turkish name:)

Image hidden

@onusoz · /2026/03/01· 06:50 PM View on

OpenClaw is already higher than Claude Code and Codex on Google Trends, this was unexpected for me

Image hidden

@onusoz · /2026/03/01· 03:38 PM View on

my blog now semi-automatically detects tweets that look like blog posts and automatically features them alongside my native jekyll blog posts. all statically generated! I am loving this setup, because it works without a backend, and can probably scale without ever needing one how it works: - @kubmi's xTap scrapes all posts that I see. these include mine - a script periodically takes my tweets and the ones I quote tweet, and syncs them to YYYY-MM-DD.jsonl files in my blog repo - an agent skill lets codex decide whether to feature the tweet or not, and makes it generate a title for it this could then be a daily cron job with openclaw for example, and I would just have to click merge every once in a while and this is still pure jekyll + some python scripts for processing I am pretty happy with how this ended up. It means I don't have to double post, and there are guarantees that my X posts will eventually make their way into my blog with minimal supervision

Image hidden

@onusoz · /2026/03/01· 08:48 AM View on

"this is the worst AI will ever be" I'm sad, not because this is right, but because it is wrong OpenAI's frontier coding model gpt-5.3-codex-xhigh feels a lot worse compared to before. It is sloppy and lazy, though it's UX got better with messages It feels like the gpt-5.2-codex-xhigh at the end of December was a lot more diligent and thorough, and did not make stupid mistakes like the one I posted before. might be a model or harness problem, I don't know @sama says users tripled since beginning of the year, so what should we expect? of course they will make infra changes that will feel like cutting corners, and I don't blame them for them and about "people want faster codex". I do want faster codex. but I want it in a way that doesn't lower the highest baseline performance compared to the previous generation. I want the optionality to dial it down to as slow as it needs to be, to be as reliable as before it is of course easier said than done. kudos to the codex team for not having any major incidents while taking the plane apart and putting it back together during flight. they are juggling an insane amount of complexity, and the whims of thousands of different stakeholders my hope is that this post is taken as a canary. I am getting dumber because of the infra changes there. I have no other option because codex was really that good compared to the competition my wish is to have detailed announcements as to what changes on openai codex infra, when it changes, so I can brace myself. we don't get notified about these changes, despite our performance and livelihoods depending on it. I have to answer to others when the tool I deemed reliable yesterday stops working today, not the tool on another note, performance curve of these models seem to be a rising sinusoidal. crests correspond to release of a new generation. they start with a smaller user base for testing, and it has the highest quality at this point. then it enshittifies as the model is scaled to the rest of the infra. we saw the pattern numerous times in the last 3 years across multiple companies, so I think we should accept it as an economic law

Image hidden