0ct0pu5/ladybird

Author	SHA1	Message	Date
Nico Weber	c99506da7d	LibGfx/JBIG2: Initialize POD members And use Array<> instead of C-style arrays.	2024-03-23 17:30:15 -04:00
Nico Weber	924423c596	LibGfx/JBIG2: Make context index a u8 This value is at most 46, so a u8 is enough. We have tens of thousands of these contexts. (We could pack the is_mps bit into that u8 as well, but then the I() and MPS() functions need to return helper objects instead of a direct reference, so let's not do that part for now.)	2024-03-20 09:09:54 +01:00
Nico Weber	b8f80501ec	LibGfx/JBIG2: Pass Context to get_next_bit() instead of to initialize() The context can vary for every bit we read. This does not affect the one use in the test which reuses the same context for all bits, but it is necessary for future changes.	2024-03-16 09:21:42 -04:00
Nico Weber	df9dd8ec69	LibGfx/JBIG2: Add arithmetic coding decoder I think the context normally changes for every bit. But this here is enough to correctly decode the test bitstream in Annex H.2 in the spec, which seems like a good checkpoint. The internals of the decoder use spec naming, to make the code look virtually identical to what's in the spec. (Even so, I managed to put in several typos that took a while to track down.)	2024-03-14 18:18:15 -06:00
Nico Weber	1eaaa8c3e9	LibPDF+LibGfx: Support JBIG2s with /JBIG2Globals set Several ramifications: * /JBIG2Globals is an indirect reference, which means we now need a Document for unfiltering. (Technically, other decode parameters can also be indirect objects and we should use the Document to resolve() those too, but in practice it only seems to be needed for /JBIG2Globals.) * Since /JBIG2Globals are so rare, we just parse once for each image that use them, and decode_embedded() now receives a Vector<ReadonlyBytes> with all sections of sequences of segments. * Internally, decode_segment_headers() is now called several times for embedded JBIG2s with multiple such sections (e.g. PDFs with /JBIG2Globals). * That means `data` is now no longer part of JBIG2LoadingContext and things get slightly reshuffled due to this. This completes the LibPDF part of JBIG2 support. Once LibGfx implements actual decoding of JBIG2s, things should start to Just Work in PDFs.	2024-03-09 16:01:22 +01:00
Nico Weber	953f6c5d9b	LibPDF+LibGfx: Pass jbig2-filtered data to JBIG2ImageDecoderPlugin Except for /JBIG2Globals, which we bail out on for now. In my 1000 files, 13 use JBIG2, and of those, 2 use JBIG2Globals (0000372.pdf e.g. page 11 and 0000857.pdf e.g. page 1), and only one (the latter) of the two uses the same JBIG2Globals stream for more than a single image. JBIG2ImageDecoderPlugin cannot decode the data yet, so no behavior change, but with `#define JBIG2_DEBUG 1` at the top of that file, it now prints segment header info for PDFs containing JBIG2 data :^)	2024-03-09 16:01:22 +01:00
Nico Weber	5cefcad2fe	LibGfx/JBIG2: Decode the file header Running `image` with `#define JBIG2_DEBUG 1` now prints number of pages.	2024-03-09 16:01:22 +01:00
Nico Weber	58838db445	LibGfx: Add the start of a JBIG2 loader JBIG2 is infamous for two things: 1. It's used in xerox scanners were it falsifies scanned numbers: https://www.dkriesel.com/en/blog/2013/0802_xerox-workcentres_are_switching_written_numbers_when_scanning 2. It was allegedly used in an iOS zero day, in a very cool way: https://googleprojectzero.blogspot.com/2021/12/a-deep-dive-into-nso-zero-click.html Needless to say, we need support for it in Serenity. (...because it's used in PDF files.) This adds all the scaffolding, but no actual implementation yet. It's enough for `file` to print the mime type of .jb2 files, but `image` can't do anything with the files yet.	2024-03-09 16:01:22 +01:00

8 commits