Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@JohnDonavon
Copy link
Collaborator

@JohnDonavon JohnDonavon commented Oct 14, 2025

…nding to groq

After investigation, the audio silence ended up being a red herring. We have been successfully sending all audio, even shortform, to the server. The issue is the perceived quality on groqs end. Our threshold of -.55 was too strong and for short audio we ended up tripping this threshold frequently.

What's updated in this PR:

  • UI updates to differentiate between audio too short and audio quality poor
  • Some small audio enhancements server-side for pcm16 audio to try and clean it up to increase quality
  • A weakening of the low quality threshold from -.55 to -.75. Testing locally the transcripts for my intentionally short and poor audios still returned properly with this new threshold.

@github-actions
Copy link

Resolves #334

- Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment.
- Maintain accuracy: do not invent or omit key details like dates, names, or numbers.
- Produce clean prose: use complSmiley faceete sentences, correct punctuation, and paragraph breaks only where needed for readability.
- Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

didn't we fix this before?

Comment on lines +38 to +48
editingPrompt: ` You are a Command-Interpreter assistant. Your job is to take a raw speech transcript-complete with hesitations, false starts, "umm"s and self-corrections-and treat it as the user issuing a high-level instruction. Instead of merely polishing their words, you must:
1. Extract the intent: identify the action the user is asking for (e.g. "write me a GitHub issue," "draft a sorry-I-missed-our-meeting email," "produce a summary of X," etc.).
2. Ignore disfluencies: strip out "uh," "um," false starts and filler so you see only the core command.
3. Map to a template: choose an appropriate standard format (GitHub issue markdown template, professional email, bullet-point agenda, etc.) that matches the intent.
4. Generate the deliverable: produce a fully-formed document in that format, filling in placeholders sensibly from any details in the transcript.
5. Do not add new intent: if the transcript doesn't specify something (e.g. title, recipients, date), use reasonable defaults (e.g. "Untitled Issue," "To: [Recipient]") or prompt the user for the missing piece.
6. Produce only the final document: no commentary, apologies, or side-notes-just the completed issue/email/summary/etc.
7. Your response MUST contain ONLY the resultant text. DO NOT include:
- Any markers like [START/END CURRENT NOTES CONTENT]
- Any explanations, apologies, or additional text
- Any formatting markers like --- or \`\`\`
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this to try to keep the prompt from leaking?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was just the result of running bun generate:constants

- Resolve corrections smoothly: when the speaker self-corrects ("let's do next week... no, next month"), choose the final phrasing.
- Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment.
- Maintain accuracy: do not invent or omit key details like dates, names, or numbers.
- Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh we did fix it before, we just never regenerated

* - Applies a gentle high-pass filter (~80 Hz)
* - Peak normalizes to ~-3 dBFS with a capped gain
*/
function enhancePcm16(pcm: Buffer, sampleRate: number): Buffer {
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice 😎 can we pull this into an audio utility file?

@JohnDonavon JohnDonavon merged commit e92b267 into dev Oct 15, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants