Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@10Ala10
Copy link
Contributor

@10Ala10 10Ala10 commented Oct 13, 2025

Summary

Add support for reading Microsoft PowerPoint (.pptx) files by implementing a dedicated PPTXReader class.

This follows the same pattern as the existing DocxReader, using python-pptx to extract text content from presentation slides. PPTX files are now properly handled instead of falling back to the generic TextReader which would fail on binary PowerPoint files.

Key changes:

  • Added PPTXReader class in knowledge/reader/pptx_reader.py
  • Added PPTX content type to the ContentType enum
  • Updated ReaderFactory to route .pptx files to the PPTX reader
  • Supports both sync and async reading with proper slide text extraction

Type of change

  • Bug fix
  • New feature
  • Breaking change
  • Improvement
  • Model update
  • Other:

Checklist

  • Code complies with style guidelines
  • Ran format/validation scripts (./scripts/format.sh and ./scripts/validate.sh)
  • Self-review completed
  • Documentation updated (comments, docstrings)
  • Examples and guides: Relevant cookbook examples have been included or updated (if applicable)
  • Tested in clean environment
  • Tests added/updated (if applicable)

Additional Notes

Requires python-pptx package: pip install python-pptx

This implementation only handles the modern .pptx format. The older .ppt format would require a different approach since it's a binary format that needs specialized parsing.

@10Ala10 10Ala10 requested a review from a team as a code owner October 13, 2025 21:12
@10Ala10 10Ala10 changed the title Feat/add-pptx-reader feat: add pptx reader support Oct 13, 2025
@10Ala10 10Ala10 requested a review from kausmeows October 16, 2025 21:38
10Ala10 and others added 4 commits October 21, 2025 12:39
…ecificity

- Changed agent model to OpenAIChat with id "gpt-4o-mini" for improved performance.
- Updated file path parameter from 'file' to 'path' for consistency.
- Enhanced the query prompt to specify key topics and main points in the presentation content.
@willemcdejongh willemcdejongh merged commit e698fc3 into agno-agi:main Oct 23, 2025
5 checks passed
Mustafa-Esoofally pushed a commit that referenced this pull request Oct 23, 2025
## Summary

Add support for reading Microsoft PowerPoint (.pptx) files by
implementing a dedicated PPTXReader class.

This follows the same pattern as the existing DocxReader, using
python-pptx to extract text content from presentation slides. PPTX files
are now properly handled instead of falling back to the generic
TextReader which would fail on binary PowerPoint files.

Key changes:
- Added PPTXReader class in knowledge/reader/pptx_reader.py
- Added PPTX content type to the ContentType enum
- Updated ReaderFactory to route .pptx files to the PPTX reader
- Supports both sync and async reading with proper slide text extraction

## Type of change

- [ ] Bug fix
- [x] New feature
- [ ] Breaking change
- [ ] Improvement
- [ ] Model update
- [ ] Other:

---

## Checklist

- [x] Code complies with style guidelines
- [x] Ran format/validation scripts (`./scripts/format.sh` and
`./scripts/validate.sh`)
- [x] Self-review completed
- [ ] Documentation updated (comments, docstrings)
- [ ] Examples and guides: Relevant cookbook examples have been included
or updated (if applicable)
- [ ] Tested in clean environment
- [ ] Tests added/updated (if applicable)

---

## Additional Notes

Requires python-pptx package: `pip install python-pptx`

This implementation only handles the modern .pptx format. The older .ppt
format would require a different approach since it's a binary format
that needs specialized parsing.

---------

Co-authored-by: Kaustubh <[email protected]>
dirkbrnd added a commit to agno-agi/agno-docs that referenced this pull request Oct 23, 2025
## Description
Add documentation for the new PPTX reader feature that allows reading
Microsoft PowerPoint (.pptx) files.

## Type of Change
- [ ] Bug fix (errors, broken links, outdated info)
- [x] New content
- [ ] Content improvement
- [ ] Other: ____

## Related Issues/PRs
- Related SDK PR:
[agno-agi/agno#5022](agno-agi/agno#5022) —
**Feat: Add PPTX Reader**

## Checklist
- [x] Content is accurate and up-to-date
- [ ] All links tested and working
- [ ] Code examples verified (if applicable)
- [x] Spelling and grammar checked
- [ ] Screenshots updated (if applicable)

---------

Co-authored-by: Dirk Brand <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants