Better predict function #19

lolipopshock · 2022-06-25T00:09:10Z

This PR introduces a new function in the VILA predictors predict_page that allows setting the maximum batch size for running the model. This can be used to control the memory usage when using the vial models.

lolipopshock · 2022-06-25T00:48:41Z

The current batching function is tested via:

import vila 
import layoutparser as lp # For visualization 

from vila.pdftools.pdf_extractor import PDFExtractor
from vila.predictors import HierarchicalPDFPredictor, LayoutIndicatorPDFPredictor

pdf_extractor = PDFExtractor("pdfplumber")
page_tokens, page_images = pdf_extractor.load_tokens_and_image("test.pdf")

vision_model = lp.EfficientDetLayoutModel("lp://PubLayNet") 
pdf_predictor = LayoutIndicatorPDFPredictor.from_pretrained("allenai/ivila-block-layoutlm-finetuned-docbank")
for idx, page_token in enumerate(page_tokens):
    blocks = vision_model.detect(page_images[idx])
    page_token.annotate(blocks=blocks)
    pdf_data = page_token.to_pagedata().to_dict()
    predicted_tokens = pdf_predictor.predict(pdf_data, page_token.page_size)
    predicted_tokens2 = pdf_predictor.predict_page(pdf_data, page_token.page_size, 1)
    assert predicted_tokens == predicted_tokens2

Add predict_page with controlled batching

412e09d

temporarily remove the predict_pdf function

66b0f5b

lolipopshock merged commit 4dc60ae into master Jun 27, 2022

lolipopshock deleted the control-predict-batch branch June 29, 2022 07:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better predict function #19

Better predict function #19

Uh oh!

lolipopshock commented Jun 25, 2022 •

edited

Loading

Uh oh!

lolipopshock commented Jun 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Better predict function #19

Better predict function #19

Uh oh!

Conversation

lolipopshock commented Jun 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lolipopshock commented Jun 25, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lolipopshock commented Jun 25, 2022 •

edited

Loading