0ct0pu5/ladybird

Author	SHA1	Message	Date
Nico Weber	1e95c08db5	LibGfx/ISOBMFF: Add JPEG2000ChannelDefinitionBox	2024-03-25 20:35:00 +01:00
Nico Weber	f080836127	LibGfx/ISOBMFF: Add JPEG2000URLBox	2024-03-25 20:35:00 +01:00
Nico Weber	c58996f4fc	LibGfx/ISOBMFF: Add JPEG2000ContiguousCodestreamBox	2024-03-25 20:35:00 +01:00
Nico Weber	f372a9b346	LibGfx/ISOBMFF: Add JPEG2000UUIDListBox	2024-03-25 20:35:00 +01:00
Nico Weber	4a95e55fb3	LibGfx/ISOBMFF: Add JPEG2000CaptureResolutionBox	2024-03-25 20:35:00 +01:00
Nico Weber	b386d5bb14	LibGfx/ISOBMFF: Add JPEG2000ResolutionBox	2024-03-25 20:35:00 +01:00
Nico Weber	7d137dc480	LibGfx/ISOBMFF: Add JPEG2000UUIDInfoBox	2024-03-25 20:35:00 +01:00
Nico Weber	214ff799ce	LibGfx/ISOBMFF: Add JPEG2000ColorSpecificationBox	2024-03-25 20:35:00 +01:00
Nico Weber	59bd378db8	LibGfx/ISOBMFF: Add JPEG2000ImageHeaderBox	2024-03-25 20:35:00 +01:00
Nico Weber	78deac3dca	LibGfx/ISOBMFF: Give Reader::read_entire_file() a factory callback This will allow creating different child boxes in different containers.	2024-03-25 20:35:00 +01:00
Nico Weber	b7a120c47e	LibGfx/ISOBMFF: Remove Box::read_from_stream() This doesn't have to be a virtual method: it's called from various create_from_stream() methods that have a static type that's created. There's no point in the virtual call here, and it makes it harder to add additional parameters to read_from_stream() in some subclasses.	2024-03-25 20:35:00 +01:00
Nico Weber	c84487ed2d	LibGfx/ISOBMFF: Give JPEG2000HeaderBox its own type ...and make SuperBox a pure superclass that's not usable by itself.	2024-03-25 20:35:00 +01:00
Nico Weber	65bd090815	LibGfx/ISOBMFF: Start creating JPEG2000 box types `isobmff` can now dump the id in a JPEG2000SignatureBox. Creates JPEG2000Boxes.{h,cpp} to house JPEG2000 box types.	2024-03-25 20:35:00 +01:00
Nico Weber	a073b2d047	LibGfx/ISOBMFF: Read JPEG2000HeaderBox	2024-03-25 20:35:00 +01:00
Nico Weber	15ba0a7e18	LibGfx/ISOBMFF: Make BoxStream MaybeOwn its stream ...and make Reader always have a BoxStream.	2024-03-25 20:35:00 +01:00
Nico Weber	a72770cdf6	LibGfx/ISOBMFF: Add JPEG2000 box types I prefixed the types that are labeled as "JPEG2000" on https://mp4ra.org/registered-types/boxes with "JPEG2000".	2024-03-25 20:35:00 +01:00
Nico Weber	cdbdc334de	LibGfx/ISOBMFF: Alphabetize box type ENUMERATE_ONE() lines	2024-03-25 20:35:00 +01:00
Nico Weber	e81009b338	LibGfx/ISOBMFF: Put string literals in box type ENUMERATE_ONE() This allows types that have spaces in their FourCC.	2024-03-25 20:35:00 +01:00
Nico Weber	bdb4f6bd49	LibGfx/ISOBMFF: Remove prototypes for nonexistent methods	2024-03-25 20:35:00 +01:00
Nico Weber	270d3303ce	LibGfx/ISOBMFF: FileTypeBox is not a FullBox	2024-03-25 20:35:00 +01:00
Nico Weber	7dd5457b8f	LibGfx/JBIG2: Add support for refinement coding template 1 This is used when refining a symbol in 0000337.pdf.	2024-03-25 13:16:02 -04:00
Nico Weber	ef9bfce0e7	LibGfx/JBIG2: Add support for SDREFAGG=1 symbol segments ...but only as long as REFAGGNINST == 1. That's enough for 0000337.pdf. Except that it also needs GRTEMPLATE=1 support in the generic refinement region decoding procedure, so no behaivor change yet.	2024-03-25 13:16:02 -04:00
Nico Weber	3fa2ecdd65	LibGfx/JBIG2: Extract read_id() into a class We'll need this for refinement/aggregate coding of symbols.	2024-03-25 13:16:02 -04:00
Nico Weber	68d47cb84a	LibGfx/JBIG2: Implement support for symbols segments with input symbols Needed for 0000337.pdf. It now fails complaining about missing SDREFAGG support.	2024-03-25 13:16:02 -04:00
Nico Weber	59e6a10f30	LibGfx/JBIG2: Initialize POD members of refinement region input struct I missed putting this in #23696 while juggling local branches. No behavior change.	2024-03-25 12:07:18 -04:00
Nico Weber	8e9157d6ce	LibGfx/JBIG2: Implement decode_end_of_stripe() a bit This is enough to be able to decode 0000857.pdf p1-4 and 0000372.pdf p11.	2024-03-25 14:08:40 +01:00
Nico Weber	c4a45bb521	LibGfx/JBIG2: Make compute_context() a function pointer ...instead of a lambda that checks the template on every call. Doesn't make a performance difference locally, but seems maybe nicer? No behavior change.	2024-03-25 14:08:40 +01:00
Nico Weber	828c640087	LibGfx/JBIG2: Make get_pixel static constexpr ...so it doesn't need to be captured.	2024-03-25 14:08:40 +01:00
Nico Weber	b45a4508c7	LibGfx/JBIG2: Implement support for context templates 1, 2, and 3 Template 2 is needed by some symbols in 0000372.pdf page 11 and 0000857.pdf pages 1-4. Implement the others too while here. (The mentioned pages in those two PDFs also use the "end of stripe" segment, so they still don't render yet. We still don't support EXTTEMPLATE.	2024-03-25 14:08:40 +01:00
Nico Weber	7035c2a2ff	LibGfx/JBIG2: Add some debug logging to decode_page_information()	2024-03-25 14:08:40 +01:00
Nico Weber	d2998c1f5e	LibGfx/JBIG2: Implement generic_refinement_region_decoding_procedure() With this, we can decode all pages of 0000425.pdf, 0000215.pdf, 0000882.pdf, and 0000057.pdf.	2024-03-25 08:15:36 +01:00
Nico Weber	0d2e91b4ea	LibGfx/JBIG2: Reject things in refinement decoding These aren't hit for my 1000 page PDF test set.	2024-03-25 08:15:36 +01:00
Nico Weber	562d8ed619	LibGfx/JBIG2: Stub out generic_refinement_region_decoding_procedure() ...and make text_region_decoding_procedure() call it. generic_refinement_region_decoding_procedure() still just returns "unimplemented", so no behavior change yet.	2024-03-25 08:15:36 +01:00
Nico Weber	c4c48c1d5f	LibGfx/JBIG2: Sketch out text segment refinement coding a bit	2024-03-25 08:15:36 +01:00
Nico Weber	9f327833c0	LibGfx/JBIG2: Read refinement adaptive template pixels for text segments Text segments using refinement are still rejected later, by text_region_decoding_procedure(). But we deserialize the input data now, and the error when this feature is used is now slightly different.	2024-03-25 08:15:36 +01:00
Nico Weber	ced21d8419	LibGfx/JBIG2: Call decode_immediate_text_region for lossless text region It seems to do the right thing already, and nothing in the spec says not to do this as far as I can tell. With this, we can finally decode the test input from #23659. See `f391c7822d` for a similar change for generic regions and lossless generic regions.	2024-03-23 17:30:15 -04:00
Nico Weber	b15e1d2b2a	LibGfx/JBIG2: Implement initial support for text segments Text segments conceptually store (x,y,id) triples. (x,y) are a coordinate, and id refers to an id from a symbol segment. A text segment has the effect of drawing some of the bitmaps stored in a symbol segment to the output bitmap. For example, the symbol segment might contain a small bitmap that happens to look like the letter 'A', and the text segment might draw that everywhere a scanned page has an 'A'. (The JBIG2 format only treats it as an abstract bitmap. It doesn't know that this small bitmap is an 'A'.) This is missing support for many things: * Huffman-coded input (not used in practice) * Symbol refinement * Transposed symbols * Colors (not used in practice) Still, we now have basic symbol/text segment support. This is enough to decode the downloadable PDF here: https://www.google.com/books/edition/Paradise_Lost/6qdbAAAAQAAJ It doesn't lead to any progression on my 1000 file test PDF set. The 7 files in there that use JBIG2 with symbol and text segments now fail to load for other reasons (4 need symbol refinement for text segments, one needs end-of-stripe segment support, one needs support for symbol segments referring to other segments). (And possibly, many other PDFs from Google Books, but that's the only one I've tried so far.)	2024-03-23 17:30:15 -04:00
Nico Weber	3454970903	LibGfx/JBIG2: Extract composite_bitbuffer() and add some features This extracts the bitbuffer combining code we had into a new function composite_bitbuffer() and adds the following features: * Real support for combination operators (which also lets us allow black as background color again, even if that's never used in practice) * Clipping support (not used here yet, but will be needed elsewhere soon) We're going to need this for text segment handling. No behavior change.	2024-03-23 17:30:15 -04:00
Nico Weber	754e1b46fc	LibGfx/JBIG2: Implement basic symbol segment processing A symbol segment defines a bunch of small bitmaps and associates them with numeric IDs. This only implements reading symbols encoded with the arithmetic coder. It does not support huffman coding. (In practice, everything seems to use arithmetic coding.) Support for refinement or aggregate coding isn't implemented yet. Support for retaining bitmap coding contexts isn't implemented yet. Support for symbol segments referring to other symbol segments isn't implemented yet. But all produce diagnostics if encountered, so we won't forget about them. (I haven't seen either being used in the wild.) No visible behavior change yet, but with JBIG2_DEBUG turned on, it produces all kinds of debug output.	2024-03-23 17:30:15 -04:00
Nico Weber	93fcb529cf	LibGfx/JBIG2: Move SegmentData down a bit Symbol segments will store decoded symbols, and for that SegmentData needs to come after BitBuffer. No behavior change.	2024-03-23 17:30:15 -04:00
Nico Weber	2099ca48a1	LibGfx/JBIG2: Pass in decoder and contexts to generic region decoder The symbol segment decoding procedure will read generic regions that aren't at a byte boundary, and that share contexts across several regions. No behavior change.	2024-03-23 17:30:15 -04:00
Nico Weber	376b1a2309	LibGfx/JBIG2: Have just one CombinationOperator enum class We already had two, and we would need another one for text segments. No behavior change.	2024-03-23 17:30:15 -04:00
Nico Weber	c06110da87	LibGfx/JBIG2: Make AdaptiveTemplatePixel toplevel We're going to need it for symbol segment decoding too. No behavior change.	2024-03-23 17:30:15 -04:00
Nico Weber	8e82c2b932	LibGfx/JBIG2: Add arithmetic integer decoder The existing ArithmeticEncoder (from Annex E) reads one bit at a time. ArithmeticIntegerDecoder (from Annex A) builds on top of that to read integer values. This will be used by both the symbol segment and the text segment readers. (This does not yet implement the IAID decoding procedure in A.3. We only need that one in the text segment decoder at the moment, and it's pretty small, so I'll put it inline there for now.) Not used yet, so no behavior change yet.	2024-03-23 17:30:15 -04:00
Nico Weber	c99506da7d	LibGfx/JBIG2: Initialize POD members And use Array<> instead of C-style arrays.	2024-03-23 17:30:15 -04:00
Nico Weber	730876fda9	LibGfx/JPEG: Add a comment to inverse_dct_8x8() See here: https://github.com/SerenityOS/serenity/issues/22739#issuecomment-1890599116 No behavior change.	2024-03-23 09:40:29 +01:00
Nico Weber	9bf29356a2	LibGfx/ISOBMFF: Support box header size 0 to mean "until end of data" JPEG2000 uses this, and as far as I can tell it's also part of ISO/IEC 14496-12.	2024-03-22 18:31:23 +01:00
Nico Weber	0d098211b7	LibRIFF+LibGfx/ISOBMFF: Make ChunkID (de)serialization self-consistent Previously, ChunkID's from_big_endian_number() and as_big_endian_number() weren't inverses of each other. ChunkID::from_big_endian_number() used to take an u32 that contained `('f' << 24) \| ('t' << 16) \| ('y' << 8) \| 'p'`, that is 'f', 't', 'y', 'p' in memory on big-endian and 'p', 'y', 't', 'f' on little-endian, and return a ChunkID for 'f', 't', 'y', 'p'. ChunkID::as_big_endian_number() used to return an u32 that for a ChunkID storing 'f', 't', 'y', 'p' was always 'f', 't', 'y', 'p' in memory on both little-endian and big-endian, that is it stored `('f' << 24) \| ('t' << 16) \| ('y' << 8) \| 'p'` on big-endian and `('p' << 24) \| ('y' << 16) \| ('t' << 8) \| 'f'` on little-endian. `ChunkID::from_big_endian_number(0x11223344).as_big_endian_number()` returned 0x44332211. This change makes the two methods self-consistent: they now take and return a u32 that always has the first ChunkID part in the highest bits of the u32 (`'f' << 24`), and so on. That also means they return a u32 that in-memory looks differently on big-endian and little-endian. Since that's normal for numbers, this also renames the two methods to just `from_number()` and `to_number()`. With the semantics cleared up, change the one use in ISOBMFF to read a BigEndian for chunk headers and brand codes. This has the effect of tags now being printed in the right order. Before: ```sh % Build/lagom/bin/isobmff ~/Downloads/sample1.jp2 Unknown Box (' Pj') [ 4 bytes ] ('pytf') (version = 0, flags = 0x0) - major_brand = ' 2pj' - minor_version = 0 - compatible_brands = { ' 2pj' } Unknown Box ('h2pj') [ 37 bytes ] Unknown Box ('fniu') [ 92 bytes ] Unknown Box (' lmx') [ 2736 bytes ] Unknown Box ('c2pj') [ 667336 bytes ] ``` After: ```sh % Build/lagom/bin/isobmff ~/Downloads/sample1.jp2 hmm 0x11223344 0x11223344 Unknown Box ('jP ') [ 4 bytes ] ('ftyp' ) (version = 0, flags = 0x0) - major_brand = 'jp2 ' - minor_version = 0 - compatible_brands = { 'jp2 ' } Unknown Box ('jp2h') [ 37 bytes ] Unknown Box ('uinf') [ 92 bytes ] Unknown Box ('xml ') [ 2736 bytes ] Unknown Box ('jp2c') [ 667336 bytes ] ```	2024-03-22 18:31:15 +01:00
Nico Weber	b43092db46	LibGfx/ISOBMFF: Print only one set of quotes around FourCCs AK::Formatter<RIFF::ChunkID> (in LibRIFF/ChunkID.h) adds them already, so don't add them here too.	2024-03-22 18:31:15 +01:00
Nico Weber	924423c596	LibGfx/JBIG2: Make context index a u8 This value is at most 46, so a u8 is enough. We have tens of thousands of these contexts. (We could pack the is_mps bit into that u8 as well, but then the I() and MPS() functions need to return helper objects instead of a direct reference, so let's not do that part for now.)	2024-03-20 09:09:54 +01:00

1 2 3 4 5 ...

582 commits