-
Notifications
You must be signed in to change notification settings - Fork 334
docs: Add audio transcription example card #5020
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Greptile Summary
This PR adds a new audio transcription example card to the documentation index page (docs/examples/index.md). The change introduces a new showcase card featuring "Audio Transcription with Whisper" that demonstrates Daft's capabilities for converting audio to text at scale. The new card follows the same HTML structure and styling pattern as existing example cards on the page.
The PR also includes several visual refinements to improve the overall layout consistency. These include reducing the card height from 400px to 380px, adjusting the overlay gradient for better visual balance, and making subtle font size adjustments. These styling changes appear to be made to accommodate the new card while maintaining a cohesive visual design across all example cards.
This addition aligns well with Daft's focus on data processing and AI/ML workloads, extending the showcase of capabilities beyond existing examples like document processing, text embeddings, and image generation. The audio transcription example represents another practical use case for Daft's scalable data processing framework.
Confidence score: 2/5
- This PR has significant risk due to potential broken links and missing assets
- Score lowered because the referenced audio transcription page and cover image likely don't exist yet
- Pay close attention to the new card's link target and image reference
1 file reviewed, no comments
universalmind303
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @desmondcheongzx !
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #5020 +/- ##
==========================================
+ Coverage 75.92% 76.40% +0.48%
==========================================
Files 938 938
Lines 129158 129256 +98
==========================================
+ Hits 98064 98761 +697
+ Misses 31094 30495 -599 🚀 New features to boost your workflow:
|
* chore: Add luban build script * docs: Add docs for custom data sources and sinks (Eventual-Inc#5115) * perf(flotilla): Use Worker Affinity with Pre-Shuffle Merge (Eventual-Inc#5112) * feat(embed_text): Support LM Studio as a provider (Eventual-Inc#5103) * fix: Add nulls in json reads if a line doesn't contain the field from the schema (Eventual-Inc#4993) * chore: Remove docs codeowners (Eventual-Inc#5111) * feat: Implement embed_image() (Eventual-Inc#5101) * docs: add dark mode support for Algolia DocSearch (Eventual-Inc#5109) * docs: add noindex tag to non-stable pages (Eventual-Inc#5105) * docs: Add text guide (Eventual-Inc#5102) * ci: Do not skip postmerge tests (Eventual-Inc#5096) * docs: Improve installation instructions (Eventual-Inc#5094) * docs: More fixes to the overview page in light mode (Eventual-Inc#5095) * chore: Clean up write_turbopuffer guide (Eventual-Inc#5093) * fix: Check if UDFs are Serializable (Eventual-Inc#5091) * docs: Document write_turbopuffer in the user guide (Eventual-Inc#5092) * feat!: revert daft.func behavior on literal arguments (Eventual-Inc#5087) * perf: Split UDFs from Filters (Eventual-Inc#5070) * fix: nightly property test (Eventual-Inc#5076) * fix: Handle Unserializable Errors in Process UDFs (Eventual-Inc#5075) * revert!: "revert: Temporarily revert "Remove deprecated APIs for 0.6" (Eventual-Inc#5084) * perf(embed_text): Let Sentence Transformers select the best available device (Eventual-Inc#5082) * feat: Automatically grab embedding dimensions for sentence transformers (Eventual-Inc#5078) * feat: add mcap datasource reader (Eventual-Inc#4727) * fix: Implement Multi-Column Aggregations with List-like columns (Eventual-Inc#5017) * fix: Fix venv command for windows build (Eventual-Inc#5073) * fix: add setuptools_scm to build wheel requirements (Eventual-Inc#5072) * fix: Use cachebusting and range request fallback for HTTP requests to Hugging Face CDNs (Eventual-Inc#5061) * fix: Use async for starting and calling udf actors in flotilla (Eventual-Inc#5000) * fix: Always refresh tqdm when updating total (Eventual-Inc#5033) * feat: Add uv.lock to git (Eventual-Inc#5065) * revert: Temporarily revert "Remove deprecated APIs for 0.6" (Eventual-Inc#5068) * docs: Make overview page legible for light mode (Eventual-Inc#5067) * docs: Move custom python code higher up in docs (Eventual-Inc#5064) * fix: Fix docs build (Eventual-Inc#5066) * docs: Add better description in overview page (Eventual-Inc#5063) * docs: remove core_concepts.md and broken anchor link references (Eventual-Inc#5062) * ci: Don't run pr test suite on non-code changes fr (Eventual-Inc#5057) * fix: require uv as prerequisite for development setup (Eventual-Inc#5059) * feat: Add Hash Function Support for Decimal128, Time, Timestamp, Timestamptz Datatypes (Eventual-Inc#5026) * docs: fix formatting (Eventual-Inc#4994) * chore: Remove deprecated APIs for 0.6 (Eventual-Inc#5050) * fix: Add missing source command in Makefile install-docs-deps target (Eventual-Inc#5060) * feat: pushdown for lance scan (Eventual-Inc#4710) * feat: add lance merge_column task (Eventual-Inc#5008) * fix: Mermaid syntax error when enable explain analyze for Native Runner (Eventual-Inc#5052) * fix: clean notebook output before running tests & tweak doc proc notebook (Eventual-Inc#5055) * docs: remove runllm widget (Eventual-Inc#5056) * chore: disable hugging face library progress bars (Eventual-Inc#5040) * fix: correct Modin query optimizer value in comparison tables (Eventual-Inc#4983) * feat: Make the max parallel of scan tasks configurable for Native Runner (Eventual-Inc#5018) * chore: relax assertion in flaky sharding distribution test (Eventual-Inc#5053) * chore(dev): use pyproject.toml to manage the dev dependencies (Eventual-Inc#4849) * chore: random the counter during creating DistributedActorPoolProject… (Eventual-Inc#5039) * fix: skip credentialed tests if not from main (Eventual-Inc#5048) * feat: basic generator udf (Eventual-Inc#5036) * docs: add reo script to docs (Eventual-Inc#5049) * fix: subprocess UDF inherits current process env (Eventual-Inc#5047) * fix: sql/spark read_iceberg and read_deltalake (Eventual-Inc#5035) * refactor!: use struct datatype as daft representation of tuples (Eventual-Inc#5030) * feat: implements an openai provider with embed_text (Eventual-Inc#4997) * fix(blc): Disabled pipefail (Eventual-Inc#5031) * docs: fix broken UDF link due to core_concepts.md redirect (Eventual-Inc#5022) * docs: fix typo "Github" --> "GitHub" (Eventual-Inc#5025) * feat: daft.File object store support (Eventual-Inc#5002) * docs: fix `df.limit` link in quickstart.md (Eventual-Inc#5013) * docs: Add audio transcription example card (Eventual-Inc#5020) * feat: Propagate morsel size top-down in swordfish (Eventual-Inc#4894) * fix(blc): Attempt to fix the broken link checker. (Eventual-Inc#5010) * docs: improve audio transcription example (Eventual-Inc#4990) * docs: Spice up the examples page (Eventual-Inc#5019) * feat: DataFrame.write_huggingface (Eventual-Inc#5015) * chore: Only test MacOS on Python 3.11 in CI (Eventual-Inc#5014) * fix: Print UDF stdout and Daft logs above the progress bar (Eventual-Inc#4861) * feat: support count(1) in dataframe and choose the cheap column (Eventual-Inc#4977) * fix: Can translate sort in flotilla (Eventual-Inc#5005) * feat: add clickhouse data sink (Eventual-Inc#4850) * fix: Lazily import pil in infer dtype (Eventual-Inc#5004) * feat: implement distributed sort in flotilla engine (Eventual-Inc#4991) * chore: Snapshot Testing Optimizations (Eventual-Inc#4995) * feat!: RowWiseUdf.eval for eager evaluation (Eventual-Inc#4998) * feat: basic read_huggingface functionality (Eventual-Inc#4996) * feat: support using max() and min() on list of boolean values (Eventual-Inc#4989) * fix: Lazily import pyarrow when importing daft (Eventual-Inc#4999) * fix: lance schema does not work (Eventual-Inc#4940) * ci: Don't run pr test suite on non-code changes (Eventual-Inc#4992) * refactor: make DaftExtension class definition static (Eventual-Inc#4968) * feat: Flotilla pre-shuffle merge (Eventual-Inc#4873) * docs: fix grammar in CONTRIBUTING.md setup instructions (Eventual-Inc#4986) * fix: correct possessive apostrophe typo in README (Eventual-Inc#4984) * fix: correct GitHub capitalization and add missing period in README (Eventual-Inc#4985) * ci: No progress bar in CI (Eventual-Inc#4988) * feat: Flotilla into partitions (Eventual-Inc#4963) * fix: ignore NotFound error of the non-first list during iter dir (Eventual-Inc#4891) * feat(optimizer): Add Lance count() pushdown optimization (Eventual-Inc#4969) * feat: adds video frame streaming source (Eventual-Inc#4979) * feat: Add offset support to Spark Connect (Eventual-Inc#4962) * feat: new `daft.File` datatype (Eventual-Inc#4959) * fix: S3 multipart upload redirect to correct region (Eventual-Inc#4865) * feat: unify all Daft type to Python type conversions (Eventual-Inc#4972) See merge request: !1
Changes Made
The spiciness continues