summaryrefslogtreecommitdiff
path: root/Userland/Libraries/LibPDF
AgeCommit message (Collapse)Author
2021-06-12LibPDF: Convert to east-const to comply with the recent style changesMatthew Olsson
2021-06-12LibPDF: Bake the flipped y-axis directly into the CTM matrixMatthew Olsson
2021-06-12LibPDF: Avoid calculating rendering matrix for every glyphMatthew Olsson
2021-06-12LibPDF: Handle the TJ graphical operatorMatthew Olsson
2021-06-12LibPDF: Handle the gs graphical operatorMatthew Olsson
2021-06-12LibPDF: Add support for the CalRGB ColorSpaceMatthew Olsson
This isn't tested all that well, as the PDF I am testing with only uses it for black (which is trivial). It can be tested further when LibPDF is able to process more complex PDFs that actually use this color space non-trivially.
2021-06-12LibPDF: Split ColorSpace into a different class for each color spaceMatthew Olsson
While unnecessary at the moment, this will allow for more fine-grained control when complex color spaces get added.
2021-06-12LibPDF: Parse hint tablesMatthew Olsson
This code isn't _actually_ used as of right now, but I wrote it at the same time as all of the code in the previous commit. I realized after I wrote it that these hint tables aren't super useful if the parser already has access to the full file. However, this will be useful if we ever want to stream PDFs from the web (and possibly view them in the browser).
2021-06-12LibPDF: Parse linearized PDF filesMatthew Olsson
This is a big step, as most PDFs which are downloaded online will be linearized. Pretty much the only difference is that the xref structure is slightly different.
2021-06-12LibPDF: Fix two parser bugsMatthew Olsson
- A newline was assumed to follow the "stream" keyword, when it can also be a windows-style line break - Fix not consuming the "endobj" at the end of every indirect object
2021-06-12LibPDF: Refine the distinction between the Document and ParserMatthew Olsson
The Parser should hold information relevant for parsing, whereas the Document should hold information relevant for displaying pages. With this in mind, there is no reason for the Document to hold the xref table and trailer. These objects have been moved to the Parser, which allows the Parser to expose less public methods (which will be even more evident once linearized PDFs are supported).
2021-06-12LibPDF: Account for inverted y axis when rendering textMatthew Olsson
2021-06-12LibPDF: Harden the document/parser against errorsMatthew Olsson
2021-06-12LibPDF: Differentiate Value's null and empty statesMatthew Olsson
2021-06-06AK+Everywhere: Disallow constructing Functions from incompatible typesAli Mohammad Pur
Previously, AK::Function would accept _any_ callable type, and try to call it when called, first with the given set of arguments, then with zero arguments, and if all of those failed, it would simply not call the function and **return a value-constructed Out type**. This lead to many, many, many hard to debug situations when someone forgot a `const` in their lambda argument types, and many cases of people taking zero arguments in their lambdas to ignore them. This commit reworks the Function interface to not include any such surprising behaviour, if your function instance is not callable with the declared argument set of the Function, it can simply not be assigned to that Function instance, end of story.
2021-06-01Everywhere: codepoint => code pointAndreas Kling
2021-05-25LibPDF: Pre-initialize common FlyStrings in CommonNames.hMatthew Olsson
2021-05-25LibPDF: Handle string encodingsMatthew Olsson
Strings can be encoded in either UTF16-BE or UTF8. In either case, there are a few initial bytes which specify the encoding that must be checked and also removed from the final string.
2021-05-25LibPDF: Parse outline structuresMatthew Olsson
2021-05-25LibPDF: Store indirect value refs in Value objectsMatthew Olsson
IndirectValueRef is so simple that it can be stored directly in the Value class instead of being heap allocated. As the comment in Value says, however, in theory the max bits needed to store is 48 (16 for the generation index and 32(?) for the object index), but 32 should be good enough for now. We can increase it to u64 later if necessary.
2021-05-25LibPDF: Add basic color space support to the rendererMatthew Olsson
This commit only supports the three most basic color spaces: DeviceGray, DeviceRGB, and DeviceCMYK
2021-05-25LibPDF: Add a very poor path clipping implementationMatthew Olsson
This completely ignores the actual path and just uses its bounding box, since our painter doesn't support clipping to paths.
2021-05-25LibPDF: Implement stubs for all graphical commandsMatthew Olsson
2021-05-25LibPDF: Add support for stream filtersMatthew Olsson
This commit also splits up StreamObject into PlainTextStreamObject and EncodedStreamObject, which is essentially just a stream object which does not own its bytes vs one which does.
2021-05-25LibPDF: Make Reader::dump_state a bit more readableMatthew Olsson
2021-05-25LibPDF: Do not assume value is an object in parse_indirect_valueMatthew Olsson
2021-05-18LibPDF/PDFViewer: Support rotated pagesMatthew Olsson
2021-05-18Applications: Add a very simple PDFViewerMatthew Olsson
2021-05-18LibPDF: Fix bad resolve_to calls in DocumentMatthew Olsson
We can't call resolve_to with IndirectValue{,Ref}, since the values will obviously be resolved, and will not give us the object of the correct type.
2021-05-18LibPDF: Add a bitmap rendererMatthew Olsson
This commit adds the Renderer class, which is responsible for rendering a page into a Gfx::Bitmap. There are many improvements to make here, but this is a great start!
2021-05-18LibPDF: Parse page crop box and user unitsMatthew Olsson
2021-05-18LibPDF: Parse graphics commandsMatthew Olsson
2021-05-18LibPDF: Don't rely on a stream's /Length key existingMatthew Olsson
Some PDFs omit this key apparently, but Firefox opens them fine. Let's emulate that behavior.
2021-05-18LibPDF: Give Parser a reference to the DocumentMatthew Olsson
The Parser will need to call resolve_to on certain values.
2021-05-10LibPDF: Parse nested Page Tree structuresMatthew Olsson
We now follow nested page tree nodes to find all of the actual page dicts, whereas previously we just assumed the root level page tree node contained all of the page children directly.
2021-05-10LibPDF: Parse page structuresMatthew Olsson
This commit introduces the ability to parse the document catalog dict, as well as the page tree and individual pages. Pages obviously aren't fully parsed, as we won't care about most of the fields until we start actually rendering PDFs. One of the primary benefits of the PDF format is laziness. PDFs are not meant to be parsed all at once, and the same is true for pages. When a Document is constructed, it builds a map of page number to object index, but it does not fetch and parse any of the pages. A page is only parsed when a caller requests that particular page (and is cached going forwards). Additionally, this commit also adds an object_cast function which logs bad casts if DEBUG_PDF is set. Additionally, utility functions were added to ArrayObject and DictObject to get all types of objects from the collections to avoid having to manually cast.
2021-05-10LibPDF: Add a basic parser and Document structureMatthew Olsson
This commit adds a parser as well as the Reader class, which serves as a utility to aid in reading the PDF both forwards and in reverse. The parser currently is capable of reading xref tables, as well as all values. We don't really do anything with any of this information, however.
2021-05-10LibPDF: Create basic object structureMatthew Olsson
This commit is the start of LibPDF, and introduces some basic structure objects. This emulates LibJS's Value structure, where Value is a simple class that can contain a pointer to a more complex Object class with more data. All of the basic PDF objects have a representation.