Thanks to visit codestin.com
Credit goes to github.com

Skip to content

docs: Link to security policy in the README#1

Merged
MthwRobinson merged 1 commit intomainfrom
core-149/security-policy
Sep 27, 2022
Merged

docs: Link to security policy in the README#1
MthwRobinson merged 1 commit intomainfrom
core-149/security-policy

Conversation

@MthwRobinson
Copy link
Contributor

Summary

Adds an explicit link to the security policy in theREADME for the repo.

@MthwRobinson MthwRobinson requested a review from qued September 27, 2022 14:20
Copy link
Contributor

@qued qued left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, link works!

@MthwRobinson MthwRobinson merged commit e290f08 into main Sep 27, 2022
@MthwRobinson MthwRobinson deleted the core-149/security-policy branch September 27, 2022 14:32
yuming-long pushed a commit that referenced this pull request Sep 29, 2022
* added makefile and poetry dependency file

* added initial pdf_to_text script

* added initial readme

* smaller heading
scanny added a commit that referenced this pull request Jul 22, 2024
Disable strategy #2 (guessed MIME-type) and strategy #3 (filename
extension) and verify that a caller that asserts the correct MIME-type
as the `content_type` argument gets the right `FileType` member.
scanny added a commit that referenced this pull request Jul 22, 2024
Add a parameterized test for strategy #2 (guessed MIME-type) that
disables strategies #1 and #3 so we know what's actually being tested.
Remove individual tests made redundant by the single parameterized test.
scanny added a commit that referenced this pull request Jul 22, 2024
Disable strategy #2 (guessed MIME-type) and strategy #3 (filename
extension) and verify that a caller that asserts the correct MIME-type
as the `content_type` argument gets the right `FileType` member.
scanny added a commit that referenced this pull request Jul 22, 2024
Add a parameterized test for strategy #2 (guessed MIME-type) that
disables strategies #1 and #3 so we know what's actually being tested.
Remove individual tests made redundant by the single parameterized test.
scanny added a commit that referenced this pull request Jul 22, 2024
Disable strategy #2 (guessed MIME-type) and strategy #3 (filename
extension) and verify that a caller that asserts the correct MIME-type
as the `content_type` argument gets the right `FileType` member.
scanny added a commit that referenced this pull request Jul 22, 2024
Add a parameterized test for strategy #2 (guessed MIME-type) that
disables strategies #1 and #3 so we know what's actually being tested.
Remove individual tests made redundant by the single parameterized test.
scanny added a commit that referenced this pull request Jul 23, 2024
Disable strategy #2 (guessed MIME-type) and strategy #3 (filename
extension) and verify that a caller that asserts the correct MIME-type
as the `content_type` argument gets the right `FileType` member.
scanny added a commit that referenced this pull request Jul 23, 2024
Add a parameterized test for strategy #2 (guessed MIME-type) that
disables strategies #1 and #3 so we know what's actually being tested.
Remove individual tests made redundant by the single parameterized test.
github-merge-queue bot pushed a commit that referenced this pull request Feb 27, 2026
…#4266)

## Problem

`_patch_current_chars_with_render_mode` is called on every
`do_TJ`/`do_Tj` text operator during PDF parsing. The original
implementation re-scans the entire `cur_item._objs` list each time,
checking `hasattr(item, "rendermode")` to skip already-patched items.
For a page with N characters across M text operations, this is O(N*M) —
effectively quadratic.

Memray profiling showed this function as the #1 allocator: 17.57 GB
total across 549M allocations in a session processing just 4 files.

## Fix

Track the last-patched index so each call only processes newly-added
`LTChar` objects. Reset automatically when `cur_item` changes (new page
or figure).

**Before:** O(N²) per page — re-scans all accumulated objects on every
text operator
**After:** O(N) per page — each object visited exactly once

---------

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: Alan Bertl <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants