pdfreader.types submodule

class pdfreader.types.objects.DictBasedObject(doc, *args, **kwargs)

Dictionary-based object. Automatically resolves indirect references on attributes/items access

class pdfreader.types.objects.StreamBasedObject(doc, stream)

Stream-based object. Automatically resolves indirect references on attributes access

class pdfreader.types.objects.ArrayBasedObject(doc, lst)

Array-based object. Automatically resolves indirect references on items access

class pdfreader.types.objects.Catalog(doc, *args, **kwargs)

Dictionary based object. (Type = Catalog) See PDF 1.7 specification sec. 7.7.2 - DocumentCatalog

class pdfreader.types.objects.PageTreeNode(doc, *args, **kwargs)

Dictionary based object. (Type = Pages) See PDF 1.7 specification sec. 7.7.3.2 - Page Tree Nodes

pages(node=None)

Yields tree node pages one by one.

Returns:Page generator.
class pdfreader.types.objects.Page(doc, *args, **kwargs)

Dictionary based Page object. (Type = Page) See PDF 1.7 specification sec. 7.7.3.3 - Page Objects

class pdfreader.types.objects.Image(doc, stream)

Stream based XObject object. (Type = XObject, Subtype = Image) See PDF 1.7 specification sec. 8.9 - Images

to_Pillow()

Converts image into PIL.Image object.

Returns:PIL.Image instance
class pdfreader.types.objects.Form(doc, stream)

Stream based XObject object. (Type = XObject, Subtype = Form) See PDF 1.7 specification sec. 8.10 - Form XObjects

class pdfreader.types.objects.XObject(doc, stream)

Stream based XObject object. (Type = XObject) See PDF 1.7 specification sec. 8.8 - External Objects

class pdfreader.types.content.InlineImage(entries, data)

BI/ID/EI operators content.

Inline image looks like a stream-based object but really it is not. We just follow Stream interface to have an option to interact with InlineImage the same way as with XObject/Image

dictionary

key-value image properties

data

bytes, encoded image stream

to_Pillow()

Converts image into PIL.Image object.

Returns:PIL.Image instance
class pdfreader.types.content.Operator(name, args)

Page content stream operator. For example: /F01 12 Tf

name

operator name

args

list of operands