More

dimitri-vs · 2026-03-01T16:10:55 1772381455

I think they already are. When I used the prompt with 5.2 it gives very concise and general info but if you use older models (5.1 instant or o3) you get a ton of detail.

Panoramix · 2026-03-01T19:30:29 1772393429

I just tried 5.1 and got the exact same output as for 5.2 (actually I got slightly less info with 5.1)

joquarky · 2026-03-01T20:42:31 1772397751

Measuring the behavior of non-deterministic systems requires more than one sample.

dimitri-vs · 2026-03-01T01:09:18 1772327358

I've tried a bunch of them only to settle on using Claude Code with remote control.

dimitri-vs · 2026-02-28T15:16:37 1772291797

As others have said: accountability

dimitri-vs · 2026-02-23T02:38:22 1771814302

What's your budget? https://en.tokyodevices.com/items/128

But seriously you can probably DIY something a lot cheaper.

dimitri-vs · 2026-02-22T19:52:04 1771789924

It's for people that don't know how or don't want to be bothered with setting up a messenger integration and a scheduler.

dimitri-vs · 2026-02-22T19:49:52 1771789792

How would it know you've ran out of milk?

stavros · 2026-02-22T19:57:33 1771790253

I told it when I noticed. I made a little pendant with a mic I can speak into and it goes to the bot.

imiric · 2026-02-22T21:03:34 1771794214

Turns out Humane was ahead of its time.

LeafItAlone · 2026-02-22T20:00:44 1771790444

I would love to hear more about this!

stavros · 2026-02-22T20:15:18 1771791318

I haven't written it up yet but the repo is here:

It's just a MEMS mic, a battery, and an ESP32, very simple but it works amazingly well. I wrote a companion Android app for it and it works extremely reliably!

Barbing · 2026-02-24T19:40:11 1771962011

I really love that. Can't wait for your writeup!

stavros · 2026-02-24T21:03:59 1771967039

The pendant is almost ready, I'll write it up this week!

Sneak peek: https://imgz.org/i6xDDz6x/

Barbing · 2026-02-26T04:52:19 1772081539

Wrong picture, it’s too small! ;) :D

Thank you, must make one!

stavros · 2026-02-26T11:56:50 1772107010

I'm going to make it 40% smaller when the small battery arrives! I really have to write the article, but I've been working on my bot all day, which is becoming extremely amazing.

Barbing · 2026-03-01T07:00:14 1772348414

Wow :D puny!

Your projects are amazing, saw your site a couple days ago and just saw your submissions now, love it and thanks!

stavros · 2026-03-01T10:19:48 1772360388

Thanks, I'm glad you like them!

liminal-dev · 2026-02-22T21:45:24 1771796724

Are you running NanoClaw or a different project?

rekmarks · 2026-02-23T03:53:34 1771818814

He's running his own thing: https://github.com/skorokithakis/stavrobot

stavros · 2026-02-24T21:04:31 1771967071

Yep, I'm running my own thing (link in sibling), I wanted something secure I could run on my PC.

dimitri-vs · 2026-02-18T02:28:00 1771381680

IMO Copilot was "we need to give these people rope, but not enough for them to hang themselves". A non technical person with no patience and access to a real AI agent inside a business is a bull in a china shop. Copilot Cowork is the closest thing we have to what Copilot should have been and is only possible now because models finally got good enough to be less supervised.

FWIW Gemini inside Google apps is just as bad.

dimitri-vs · 2026-02-16T23:33:46 1771284826

I don't think LLMs are very good at introspection on what they know or don't know, but otherwise this is gold. Thanks for sharing.

dimitri-vs · 2026-02-16T03:56:05 1771214165

API Opus 4.6 will tell you it's still 2025, admit it's wrong then revert back to being convinced it's 2025 as it nears it's context limit.

I'll go so far as to say LLM agents are AGI-lite but saying we "just need the orchestration layer" is like saying ok we have a couple neurons, now we just need the rest of the human.

ryanSrich · 2026-02-16T04:02:28 1771214548

Giving opus a memory or real-time access to the current year is trivial. I don't see how that's an argument against it being AGI.

dimitri-vs · 2026-02-16T03:36:02 1771212962

Manual orchestration is a brittle crutch IMO - you don't get to the moon by using longer and longer ladders. A powerful model in theory should be able to self orchestrate with basic tools and environment. The thing is that it also might be as expensive as a human to run - from a tokens AND liability perspective.