More

d4rkp4ttern · 2026-01-23T12:47:46 1769172466

As others said this was possible for months already with llama-cop’s support for Anthropic messages API. You just need to set the ANTHROPIC_BASE_URL. The specific llama-server settings/flags were a pain to figure out and required some hunting, so I collected them in this guide to using CC with local models:

https://github.com/pchalasani/claude-code-tools/blob/main/do...

One tricky thing that took me a whole day to figure out is that using Claude Code in this setup was causing total network failures due to telemetry pings, so I had to set this env var to 1: CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC

d4rkp4ttern · 2026-01-23T12:42:35 1769172155

Curious what llama-server flags you used. On my M1 Max 64GB MacBook I tried it in Claude Code (which has a 25K system message) and I get 3 tps.

But with Qwen3-30B-A3B I get 20 tps in CC.

d4rkp4ttern · 2026-01-23T12:23:29 1769171009

Curious how it compares to last week’s release of Kyutai’s Pocket-TTS [1] which is just 100M params, and excellent in both speed and quality (English only). I use it in my voice plugin [2] for quick voice updates in Claude Code.

[1] https://github.com/kyutai-labs/pocket-tts

[2] https://github.com/pchalasani/claude-code-tools?tab=readme-o...

d4rkp4ttern · 2026-01-19T00:49:26 1768783766

First time I’m hearing about a shortcut for this. I always use 2 hyphens. Is that not considered an em-dash ?

FridayoLeary · 2026-01-19T01:37:40 1768786660

You are absolutely right — most internet users don't know the specific keyboard combination to make an em dash and substitute it with two hyphens. On some websites it is automatically converted into an em dash. If you would like to know more about this important punctuation symbol and it's significance in identitifying ai writing, please let me know.

d4rkp4ttern · 2026-01-19T01:56:09 1768787769

Wow thanks for the enlightenment. I dug into this a bit and found out:

Hyphen (-) — the one on your keyboard. For compound words like “well-known.”

En dash (–) — medium length, for ranges like 2020–2024. Mac: Option + hyphen. Windows: Alt + 0150.

Em dash (—) — the long one, for breaks in thought. Mac: Option + Shift + hyphen. Windows: Alt + 0151.

And now I also understand why having plenty of actual em-dashes (not double hyphens) is an “AI tell”.

acidburnNSA · 2026-01-19T11:19:23 1768821563

If you have the compose key enabled it's trivial to write all sorts of things. Em dash is compose (right alt for me) ---

En dash is compose --.

You can type other fun things like section symbol (compose So) and fractions like ⅐ with compose 17, degree symbol (compose oo) etc.

https://itsfoss.com/compose-key-gnome-linux/

On phones you merely long press hyphen to get the longer dash options.

FridayoLeary · 2026-01-19T02:10:32 1768788632

Thanks for that. I had no idea either. I'm genuinely surprised Windows buries such a crucial thing like this. Or why they even bothered adding it in the first place when it's so complicated.

jsheard · 2026-01-19T02:34:22 1768790062

The Windows version is an escape hatch for keying in any arbitrary character code, hence why it's so convoluted. You need to know which code you're after.

semilin · 2026-01-19T02:36:42 1768790202

To be fair, the alt-input is a generalized system for inputting Unicode characters outside the set keyboard layout. So it's not like they added this input specifically. Still, the em dash really should have an easier input method given how crucial a symbol it is.

kevin_thibedeau · 2026-01-19T04:29:15 1768796955

It's a generalized system for entering code page glyphs that was extended to support Unicode. 0150 and 0151 only work if you are on CP1252 as those aren't the Unicode code points.

wincy · 2026-01-19T02:37:12 1768790232

And Em Dash is trivially easy on iOS — you simply hold press on the regular dash button - I’ve been using it for years and am not stopping because people might suddenly accuse me of being an AI.

tverbeure · 2026-01-19T02:38:28 1768790308

Thanks for delving into this key insight!

keyle · 2026-01-19T02:05:58 1768788358

No it's not the same. Note there are medium and long as well.

That said I always use -- myself. I don't think about pressing some keyboard combo to emphasise a point.

PaulDavisThe1st · 2026-01-19T03:58:26 1768795106

The long --- if you're that way minded --- is just 3 hyphens :)

d4rkp4ttern · 2026-01-19T02:08:49 1768788529

Yep I realize this now, as I said in my other comment.

d4rkp4ttern · 2026-01-17T21:20:32 1768684832

Context filling up is sort of the Achilles heel of CLI agents. The main remedy is to have it output some type of handoff document and then run /compact which leaves you with a summary of the latest task. It sort of works but by definition it loses information, and you often find yourself having to re-explain or re-generate details to continue the work.

I made a tool[1] that lets you just start a new session and injects the original session file path, so you can extract any arbitrary details of prior work from there using sub-agents.

[1] aichat tool https://github.com/pchalasani/claude-code-tools?tab=readme-o...

d4rkp4ttern · 2026-01-17T21:12:49 1768684369

Yes you can literally just ask Claude Code to create a status line showing context usage. I had it make this colored progress bar of context usage, changing thru green, yellow, orange, red as context fills up. Instructions to install:

https://github.com/pchalasani/claude-code-tools?tab=readme-o...

d4rkp4ttern · 2026-01-16T23:05:20 1768604720

Super nice and convenient to use as a CLI. I made it into a plugin for Claude Code to give a 1-sentence spoken status update whenever it stops:

claude plugin marketplace add pchalasani/claude-code-tools

claude plugin install voice@cctools-plugins

More here: https://github.com/pchalasani/claude-code-tools?tab=readme-o...

d4rkp4ttern · 2026-01-15T12:09:10 1768478950

Specifically for coding agents, one issue is how to continue work when almost fill the context window.

Compaction always loses information, so I use an alternative approach that works extremely well, based on this almost silly idea — your original session file itself is the golden source of truth with all details, so why not directly leverage it?

So I built the aichat feature in my Claude-code-tools repo with exactly this sort of thought; the aichat rollover option puts you in a fresh session, with the original session path injected, and you use sub agents to recover any arbitrary detail at any time. Now I keep auto-compact turned off and don’t compact ever.

https://github.com/pchalasani/claude-code-tools?tab=readme-o...

It’s a relatively simple idea; no elaborate “memory” artifacts, no discipline or system to follow, work until 95%+ context usage.

The tool (with the related plugins) makes it seamless: first type “>resume” in your session (this copies session id to clipboard), then quit and run

    aichat resume <pasted session id>

And this launches a TUI offering a few ways to resume your work, one of which is “rollover”; this puts you in a new session with the original session jsonl path injected. And in the new session say something like,

“There is a chat session log file path shown to you; Use subagents strategically to extract details of the task we were working on at the end of it”, or use the /recover-context slash command. If it doesn’t quite get all of it, prompt it again for specific details.

There’s also an aichat search command for rust/tantivy based fast full text search to search across sessions, with a TUI for humans and a CLI/JSON mode for agents/subagents. The latter ( and the corresponding skill and sub agent) can be used to recover arbitrary detailed context about past work.

d4rkp4ttern · 2026-01-15T11:51:09 1768477869

Besides being trash as others said, there’s a trade off with real time transcription word by word - there’s no opportunity for an AI to holistically correct/clean up the transcription

SkyPuncher · 2026-01-15T20:11:49 1768507909

But, OSX does come back and fix things.

d4rkp4ttern · 2026-01-15T20:19:36 1768508376

You mean, after displaying each word as it is spoken, then OSX goes back and fixes what’s been displayed? I think I’ve seen it fix one or two recent words, but I guess you’re saying it could fix the entire sentence as well. I didn’t know that

SkyPuncher · 2026-01-16T00:40:27 1768524027

Yea, I use it daily for getting my thoughts into Claude. I often see it rewriting sentences it’s confused on.

d4rkp4ttern · 2026-01-15T11:49:15 1768477755

I’ve tried several, including this one, and I’ve settled on VoiceInk (local, one-time payment), and with Parakeet V3 it’s stunningly fast (near-instant) and accurate enough to talk to LLMs/code-agents, in the sense that the slight drop in accuracy relative to Whisper Turbo3 is immaterial since they can “read between the lines” anyway.

My regular cycle is to talk informally to the CLI agent and ask it to “say back to me what you understood”, and it almost always produces a nice clean and clear version. This simultaneously works as confirmation of its understanding and also as a sort of spec which likely helps keep the agent on track.

UPDATE - just tried handy with Parakeet v3, and it works really well too, so I'll use this instead of VoiceInk for a few days. I just also discovered that turning on the "debug" UI with Cmd-shift-D shows additional options like post processing and appending trailing space.

thethimble · 2026-01-15T16:20:39 1768494039

I wish one of these models was fine tuned for programming.

I want to be able to say things like "cd ~/projects" or "git push --force".

netghost · 2026-01-15T17:10:32 1768497032

I'll bet you could take a relatively tiny model and get it to translate the transcribed "git force push" or "git push dash dash force" into "git push --force".

Likewise "cd home slash projects" into "cd ~/projects".

Maybe with some fine tuning, maybe without.

vismit2000 · 2026-01-16T05:31:54 1768541514

You can try VSCode Speech to Text extension that works decently well in Github Copilot chat as part of Microsoft accessibility suite.

swah · 2026-01-18T11:27:08 1768735628

Or just enjoy your last days of cd'ing, this shall pass soon!