-
Couldn't load subscription status.
- Fork 56
Add ui for low quality, max threshold more lax, enhance pcm before se… #340
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Resolves #334 |
| - Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment. | ||
| - Maintain accuracy: do not invent or omit key details like dates, names, or numbers. | ||
| - Produce clean prose: use complSmiley faceete sentences, correct punctuation, and paragraph breaks only where needed for readability. | ||
| - Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
didn't we fix this before?
| editingPrompt: ` You are a Command-Interpreter assistant. Your job is to take a raw speech transcript-complete with hesitations, false starts, "umm"s and self-corrections-and treat it as the user issuing a high-level instruction. Instead of merely polishing their words, you must: | ||
| 1. Extract the intent: identify the action the user is asking for (e.g. "write me a GitHub issue," "draft a sorry-I-missed-our-meeting email," "produce a summary of X," etc.). | ||
| 2. Ignore disfluencies: strip out "uh," "um," false starts and filler so you see only the core command. | ||
| 3. Map to a template: choose an appropriate standard format (GitHub issue markdown template, professional email, bullet-point agenda, etc.) that matches the intent. | ||
| 4. Generate the deliverable: produce a fully-formed document in that format, filling in placeholders sensibly from any details in the transcript. | ||
| 5. Do not add new intent: if the transcript doesn't specify something (e.g. title, recipients, date), use reasonable defaults (e.g. "Untitled Issue," "To: [Recipient]") or prompt the user for the missing piece. | ||
| 6. Produce only the final document: no commentary, apologies, or side-notes-just the completed issue/email/summary/etc. | ||
| 7. Your response MUST contain ONLY the resultant text. DO NOT include: | ||
| - Any markers like [START/END CURRENT NOTES CONTENT] | ||
| - Any explanations, apologies, or additional text | ||
| - Any formatting markers like --- or \`\`\` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this to try to keep the prompt from leaking?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was just the result of running bun generate:constants
| - Resolve corrections smoothly: when the speaker self-corrects ("let's do next week... no, next month"), choose the final phrasing. | ||
| - Preserve natural phrasing: maintain contractions and informal tone if present, unless clarity demands adjustment. | ||
| - Maintain accuracy: do not invent or omit key details like dates, names, or numbers. | ||
| - Produce clean prose: use complete sentences, correct punctuation, and paragraph breaks only where needed for readability. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh we did fix it before, we just never regenerated
| * - Applies a gentle high-pass filter (~80 Hz) | ||
| * - Peak normalizes to ~-3 dBFS with a capped gain | ||
| */ | ||
| function enhancePcm16(pcm: Buffer, sampleRate: number): Buffer { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice 😎 can we pull this into an audio utility file?
…nding to groq
After investigation, the audio silence ended up being a red herring. We have been successfully sending all audio, even shortform, to the server. The issue is the perceived quality on groqs end. Our threshold of -.55 was too strong and for short audio we ended up tripping this threshold frequently.
What's updated in this PR: