Add ui for low quality, max threshold more lax, enhance pcm before se… #340

JohnDonavon · 2025-10-14T22:52:10Z

…nding to groq

After investigation, the audio silence ended up being a red herring. We have been successfully sending all audio, even shortform, to the server. The issue is the perceived quality on groqs end. Our threshold of -.55 was too strong and for short audio we ended up tripping this threshold frequently.

What's updated in this PR:

UI updates to differentiate between audio too short and audio quality poor
Some small audio enhancements server-side for pcm16 audio to try and clean it up to increase quality
A weakening of the low quality threshold from -.55 to -.75. Testing locally the transcripts for my intentionally short and poor audios still returned properly with this new threshold.

…nding to groq

github-actions · 2025-10-14T22:52:19Z

Resolves #334

fulltimemike · 2025-10-15T00:15:33Z

lib/constants/generated-defaults.ts

 - Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment.
 - Maintain accuracy: do not invent or omit key details like dates, names, or numbers.
- Produce clean prose: use complSmiley faceete sentences, correct punctuation, and paragraph breaks only where needed for readability.
+- Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability.


didn't we fix this before?

fulltimemike · 2025-10-15T00:18:12Z

server/src/constants/generated-defaults.ts

+  editingPrompt: ` You are a Command-Interpreter assistant. Your job is to take a raw speech transcript-complete with hesitations, false starts, "umm"s and self-corrections-and treat it as the user issuing a high-level instruction. Instead of merely polishing their words, you must:
+    1.	Extract the intent: identify the action the user is asking for (e.g. "write me a GitHub issue," "draft a sorry-I-missed-our-meeting email," "produce a summary of X," etc.).
+    2.	Ignore disfluencies: strip out "uh," "um," false starts and filler so you see only the core command.
+    3.	Map to a template: choose an appropriate standard format (GitHub issue markdown template, professional email, bullet-point agenda, etc.) that matches the intent.
+    4.	Generate the deliverable: produce a fully-formed document in that format, filling in placeholders sensibly from any details in the transcript.
+    5.	Do not add new intent: if the transcript doesn't specify something (e.g. title, recipients, date), use reasonable defaults (e.g. "Untitled Issue," "To: [Recipient]") or prompt the user for the missing piece.
+    6.	Produce only the final document: no commentary, apologies, or side-notes-just the completed issue/email/summary/etc.
+    7. Your response MUST contain ONLY the resultant text. DO NOT include:
+      - Any markers like [START/END CURRENT NOTES CONTENT]
+      - Any explanations, apologies, or additional text
+      - Any formatting markers like --- or \`\`\`


Is this to try to keep the prompt from leaking?

This was just the result of running bun generate:constants

fulltimemike · 2025-10-15T00:19:45Z

shared-constants.js

 - Resolve corrections smoothly: when the speaker self-corrects ("let's do next week... no, next month"), choose the final phrasing.
 - Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment.
 - Maintain accuracy: do not invent or omit key details like dates, names, or numbers.
 - Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability.


Oh we did fix it before, we just never regenerated

fulltimemike · 2025-10-15T00:21:22Z

server/src/services/ito/itoService.ts

+ * - Applies a gentle high-pass filter (~80 Hz)
+ * - Peak normalizes to ~-3 dBFS with a capped gain
+ */
+function enhancePcm16(pcm: Buffer, sampleRate: number): Buffer {


Nice 😎 can we pull this into an audio utility file?

Add ui for low quality, max threshold more lax, enhance pcm before se…

a3705b7

…nding to groq

fix test

c993ef1

fulltimemike reviewed Oct 15, 2025

View reviewed changes

fulltimemike approved these changes Oct 15, 2025

View reviewed changes

Move enhancepcm16 into audio util

e42d9ef

JohnDonavon merged commit e92b267 into dev Oct 15, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add ui for low quality, max threshold more lax, enhance pcm before se… #340

Add ui for low quality, max threshold more lax, enhance pcm before se… #340

Uh oh!

JohnDonavon commented Oct 14, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

fulltimemike Oct 15, 2025

Uh oh!

fulltimemike Oct 15, 2025

Uh oh!

JohnDonavon Oct 15, 2025

Uh oh!

fulltimemike Oct 15, 2025

Uh oh!

fulltimemike Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add ui for low quality, max threshold more lax, enhance pcm before se… #340

Add ui for low quality, max threshold more lax, enhance pcm before se… #340

Uh oh!

Conversation

JohnDonavon commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

fulltimemike Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

fulltimemike Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

JohnDonavon Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

fulltimemike Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

fulltimemike Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JohnDonavon commented Oct 14, 2025 •

edited

Loading