I made something called `ultraplan`. It's is a CLI tool that records multi-modal... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		mritchie712 12 days ago \| parent \| context \| favorite \| on: Handy – Free open source speech-to-text app I made something called `ultraplan`. It's is a CLI tool that records multi-modal context (audio transcription via local Whisper, screenshots, clipboard content, etc.) into a timeline that AI agents like Claude Code can consume. I have a claude skill `/record` that runs the CLI which starts a new recording. I debug, research, etc., then say "finito" (or choose your own stopword). It outputs a markdown file with your transcribed speech interleaved with screenshots and text that you copied. You can say other keywords like "marco" and it will take a screenshot hands-free. When the session ends, claude reads the timeline (e.g. looks at screenshots) and gets to work. I can clean it up and push to github if anyone would get use out of it.

mritchie712 12 days ago | [–]

https://github.com/definite-app/ultraplan

heliostatic 12 days ago | | [–]

Definitely interested in that!

mritchie712 12 days ago | | [–]

Added link above!

wanderingmind 12 days ago | [–]

Sounds interesting I would love to use it if you get a chance to push to github

mritchie712 12 days ago | [–]

https://github.com/definite-app/ultraplan

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact