0ct0pu5/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-12-03 13:00:29 +00:00

Author	SHA1	Message	Date
Nico Weber	4549d6cf1b	LibPDF: Add a FIXME comment to the inline image data skipping path	2023-10-26 10:59:45 +02:00
Nico Weber	2878af5968	LibPDF: Sketch out Lab color space Same as other recent color spaces: Enough to make us not assert, but not enough to actually produce color. Fixes 2 asserts on the `-n 500` 0000.zip pdfa dataset.	2023-10-26 10:59:45 +02:00
Tim Ledbetter	1a4df4ffe7	LibGfx/ICC: Avoid overflow when constructing `NamedColor2TagData`	2023-10-26 10:59:22 +02:00
Nico Weber	a65d8ff2ea	LibPDF: Tolerate page rotation being an indirect object Needed e.g. for 0000196.pdf in 0000.zip in the pdfa dataset.	2023-10-26 10:58:45 +02:00
Nico Weber	8b806183f6	LibPDF: Tolerate indirect objects in various image dict values 0000101.pdf from 0000.zip from the pdfa dataset has /Height set to an indirect object that contains an int. Make that work, and make sure various other similar places getting values of the image dict also resolve indirect references.	2023-10-26 10:58:45 +02:00
Nico Weber	5dd7639386	LibPDF: Tolerate indirect references in Type0 /W array Makes e.g. 0000236.pdf in 0000.zip in the pdfa dataset work.	2023-10-26 10:58:45 +02:00
Nico Weber	b928fadba7	LibPDF: Swap int and array branches in outline item reading No intended behavior change. It does have the effect that indirect object references now go down the array path instead of the number path. They still fall over there, but now that's easy to fix.	2023-10-26 10:58:45 +02:00
Nico Weber	208a058eab	LibPDF: Tolerate integer outline item colors 0000296.pdf from 0000.zip from the pdfa dataset contains `/C [0 0 0]` (as opposed to `/C [0.0 0.0 0.0]`). Make that work. (It's fine per spec.)	2023-10-26 10:58:45 +02:00
Tim Ledbetter	7096ea82f9	LibGfx: Use `count_leading_zeroes` to calculate nearest power of 2 This removes the possibility of an infinite loop.	2023-10-26 08:39:26 +02:00
Tim Ledbetter	6e4c97a328	LibGfx/WOFF: Return error if `numTables` is 0 This is consistent with WOFF2.	2023-10-26 08:39:26 +02:00
Tim Ledbetter	52f78d07b8	LibGfx/WOFF2: Ensure `numTables` is within expected range An error is now returned if `numTables` is zero or greater than 4096. While this isn't explicitly mentioned in the specification, subsequent calculations will be incorrect if the value falls outside this range.	2023-10-26 08:39:26 +02:00
Aliaksandr Kalenik	437442719d	LibWeb: Fix coordinate translation for PaintTextShadow command Fixes regression introduced in `4318bcf447` `shadow_bounding_rect` is used on bitmap allocated for shadow and is not supposed to be in coordinate system of stacking context. Same for `text_rect`. Fixes https://github.com/SerenityOS/serenity/issues/21587	2023-10-26 08:38:16 +02:00
Aliaksandr Kalenik	58f8068853	LibWeb: Handle fit-content in calculate_max_content_contribution in GFC Fixes https://github.com/SerenityOS/serenity/issues/21569	2023-10-26 08:37:42 +02:00
Aliaksandr Kalenik	04a4ffaa3d	LibWeb/Painting: Remove Save and Restore commands in RecordingPainter Removing State and Restore reduces painting commands count by 30-50% on an average website. Due to this change, it was also necessary to replace AddClipRect with SetClipRect command.	2023-10-26 07:51:32 +02:00
networkException	0388766531	LibWeb: Set module map entry before invoking callbacks This patch fixes the value of a module map entry being wrong in the callbacks invoked in the set call. Previously we would set the value in only after invoking the callbacks, leading to crashes when a callback implementation would rightfully assume the value to be set already. Resolves #20994	2023-10-25 21:29:21 +02:00
Bastiaan van der Plaat	169d24ae2e	LibWeb: Add comments and missing items of various IDL files	2023-10-25 19:45:41 +02:00
Bastiaan van der Plaat	fc46def2f5	LibWeb: Reorder and add missing HTML elements IDL items	2023-10-25 19:45:41 +02:00
Nico Weber	54cdcd0d06	LibPDF: Reject non-hexdigits in hex string with error ...instead of VERIFY()ing input data. I haven't seen this in the wild, but since I'm here anyways, might as well fix this.	2023-10-25 10:44:26 +02:00
Nico Weber	4675700057	LibPDF: Reject unterminated literal strings with an error 0000459.pdf in 0000.zip in the pdfa dataset contains this as the very first object: ``` 1 0 obj << /Creator (Developer 2000) /CreatorDate ( /Author (Oracle Reports) /Producer (Oracle PDF driver) /Title (2021_06_29 Tutoritzacions APTES.PDF) >> endobj ``` The `/CreatorDate` value string is unterminated. Before, we'd assert when trying to check if the first object is a linearization dict. Now, we never read the first object (an error during the linearization dict reading is treated as "file is not linearized") unless we try to print the document's metadata -- and there we now show an error instead of asserting.	2023-10-25 10:44:26 +02:00
Nico Weber	c0f3f1674c	LibPDF: Make string literal parsing fallible ...and make running out of data after a \ an error instead of silently returning an empty string.	2023-10-25 10:44:26 +02:00
Aliaksandr Kalenik	4318bcf447	LibWeb: Record painting commands in coordinates of stacking context By storing painting command coordinates relative to the nearest stacking context we can get rid of the Translate command. Additionally, this allows us to easily check if the bounding rectangles of the commands cover or intersect within a stacking context. This should be useful if we decide to optimize by avoiding the execution of commands that will be overpainted by the results of subsequent commands.	2023-10-25 05:53:36 +02:00
Nico Weber	311cc7d9b9	LibPDF: Implement two SeparationColorSpace methods Actually using separation color spaces still doesn't work, but we now no longer assert on them when they're used. Fixes 2 crashes on the `-n 500` 0000.zip pdfa dataset.	2023-10-25 05:52:47 +02:00
Tim Ledbetter	2311e28d63	LibGfx/BMPLoader: Mitigate potential overflows when decoding bitmap DIB	2023-10-25 05:52:29 +02:00
Tim Ledbetter	8ec26f3b54	LibGfx/BMPLoader: Account for header size when checking DIB bounds	2023-10-25 05:52:29 +02:00
Timothy Flynn	4e0a926737	LibWeb: Protect ad-hoc `scroll` against a potentially null paintable box We perform such a check in other users of the paintable box in this file as the box may be null before layout completes. This prevents UB seen in some CI runs.	2023-10-25 05:49:37 +02:00
tetektoza	d2995f7517	LibGUI: Add missing constructor to UISize class for fixed_size property This patch adds a missing constructor to UISize class for fixed_size property, so the property can take an array if user specified it in .gml file.	2023-10-24 21:47:18 +02:00
Nico Weber	e7f7c434f7	LibPDF: Don't check for `startxref` after trailer dict Several files have a comment after the trailer dict and the `startxref` after it. We really should add a consume_whitespace_and_comments() function and call that in most places we currently call consume_whitespace(). But in this case, for non-linearized files, we first jump to the end of the file, read `startxref`, then jump to `xref` from the offset there, and then read the trailer after the `xref`, only to read `startxref` again. So we can just not do that. (For linearized files, we now completely ignore `startxref`. But we don't use the data in `startxref` in linearized files anyways, so it's fine to not read it there too.) Reduces number of crashes on 300 random PDFs from the web (the first 300 from 0000.zip from https://pdfa.org/new-large-scale-pdf-corpus-now-publicly-available/) from 25 (8%) to 23 (7%).	2023-10-24 13:32:01 -04:00
Nico Weber	acf668e234	LibPDF: Make Reader::move_by() parameter more truthful No behavior change, just simpler and less surprising.	2023-10-24 13:30:25 -04:00
Aliaksandr Kalenik	b13ff8def6	LibWeb: Separate "out of view" check from RecordingPainter commands	2023-10-24 18:55:12 +02:00
Aliaksandr Kalenik	94f322867a	LibWeb: Get rid of DevicePixels usage in RecordingPainter This change removes inconsistency in coordinate representation of painter commands by changing everything to use int.	2023-10-24 18:55:12 +02:00
Tim Ledbetter	e9be1bcd09	LibGfx/WOFF2: Reject fonts with a compressed size larger than 10MiB This prevents a potential OOM condition when the header is malformed.	2023-10-24 13:45:01 +02:00
Tim Ledbetter	af633523af	LibGfx/WOFF2: Tolerate incorrect `totalSfntSize` in WOFF2 header The specification says that this value is for reference only, so we should be able to load a file where this value is incorrect.	2023-10-24 13:45:01 +02:00
Tim Ledbetter	c2112cde76	LibGfx/WOFF: Ensure header `totalSfntSize` matches expected value	2023-10-24 07:29:09 +02:00
Tim Ledbetter	7ee09ca49d	LibGfx/WOFF: Avoid overflow in table directory search range This commit limits `WOFF::Header::num_tables` to 4096. This limitation is not explicitly mentioned in the specification, but allowing numbers larger than this results in an overflow when calculating `search_range` and `range_shift`.	2023-10-24 07:29:09 +02:00
Timothy Flynn	6af279a22d	LibWebView: Add a helper to get selected text with collapsed whitespace	2023-10-24 07:28:30 +02:00
Timothy Flynn	e221b3afeb	LibWebView: Add an API to format a search query for UI display This will create a string of the form: Search DuckDuckGo for "Ladybird is awesome!" If the provided query URL is unknown, the engine name is excluded (e.g. for custom search URLs).	2023-10-24 07:28:30 +02:00
Timothy Flynn	c8c3d00615	LibWebView: Rename find_search_engine to find_search_engine_by_name We will also need to search by URL in the Serenity chrome.	2023-10-24 07:28:30 +02:00
Bastiaan van der Plaat	d5ca8209bf	LibWeb: Add input element valueAsNumber	2023-10-24 07:27:14 +02:00
Bastiaan van der Plaat	d290569535	LibWeb: Don't create shadow root for input hidden	2023-10-24 07:27:14 +02:00
Bastiaan van der Plaat	3e778fe29c	LibWeb: Reorder and add missing form elements IDL items	2023-10-24 07:27:14 +02:00
Aliaksandr Kalenik	4dab17427f	LibWeb: Use max content contribution in flex_fraction in GFC As spec comment in the code says we should use item’s max-content contribution to calculate flex fraction. Likely, it was calculate_max_content_size() because we didn't have calculate_max_content_contribution() when this function was implemented initially.	2023-10-24 07:26:25 +02:00
Aliaksandr Kalenik	802b58d7e1	LibWeb: Resolve grid item's min-width and max-width in GFC Now min-width and max-width properties affect resulting width of grid item instead of being ignored.	2023-10-24 07:25:20 +02:00
Adrian Mück	7e10f76021	LibGUI: Don't update ComboBox text when model index is invalid Without ENABLE_TIME_ZONE_DATA the user is able to update the time zone selection when using "on_mousewheel" or "on_{up,down}_pressed" even though UTC is supposed to be the only option. Due to an invalid model index, the selection is set to "[null]". Prior to this patch, the ComboBox text in "selection_updated" is set at each call.	2023-10-23 21:18:36 -04:00
Nico Weber	3fe9f8e48d	LibPDF: Don't accidentally form new tokens on pages with contents arrays A page's /Contents can be an array of streams, and the page's contents are then as if those streams are concatenated. Most of the time, a stream ends with whitespace. But in some cases (e.g. 0000642.pdf from 0000.zip from the pdfa dataset), the first stream ends with an operator (`Q`) and the next stream starts with one (`q`), and the concatenation would form a new, unkonwn operator (`Qq`). Separate the streams' contents with a space to prevent that. Reduces numbers of PDF files we fail to open in the -n 500 case from 11 to 10 (in either case, we then crash on 18 of the PDFs that we do manage to open).	2023-10-23 13:23:54 -04:00
Timothy Flynn	b770ed03ac	LibWebView: Define the list of built-in search engines in LibWebView These engines and their query URLs are duplicated in several places. Before implementing search support in the AppKit chrome, let's move these engines to LibWebView.	2023-10-23 12:12:36 -04:00
Nico Weber	11bee7a075	LibPDF: Don't crash on fixed-width type 1 fonts that use /MissingWidth Type 1 fonts usually have a m_font_program and no m_font -- they only have m_font if we're using a replacement font for the fonts that were built-in to PDFs before Acrobat 4.0 (and must still work to show existing files). However, SimpleFont::get_glyph_width() used to always return a float, which in Type1Font was only implemented if m_font was set. Per spec, we're supposed to just use /MissingWidth for fonts that are missing an entry in the descriptor's /Width array. However, for built-in fonts, no explicit /Width array is needed (PDF 1.7 spec, Appendix H.3, 5.5.1). So if we just always use /MissingWidth, then PDFs that use a built-in font draw all their text on top of each other (e.g. 000333.pdf from stillhq.com-pdfdb). So change get_glyph_width() to return Optional<float>, return it only in Type1Font if m_font is set, and use MissingWidth if it isn't set. That way, replacement fonts still return a width, and real fonts that are supposed to have /Width and use /MissingWidth for missing entries do what they're supposed to too, instead of crashing. From 20 (6%) to 16 (5%) crashes on the 300 first PDFs, and from 39 (7.8%) to 31 (6.2%) on the 500-random PDFs test.	2023-10-23 09:33:03 -04:00
Nico Weber	52afa936c4	LibPDF: Don't over-read in charset formats 1 and 2 `left` might be a number bigger than there are actually glyphs in the CFF. The spec says "The number of ranges is not explicitly specified in the font. Instead, software utilizing this data simply processes ranges until all glyphs in the font are covered." Apparently we have to check for this within each range as well. Needed for example in 0000054.pdf and 0000354.pdf in 0000.zip in the pdfa dataset. Together with the previous commit: From 21 (7%) to 20 (6%) crashes on the 300 first PDFs, and from 41 (8.2%) to 39 (7.8%) on the 500-random PDFs test.	2023-10-23 09:31:11 -04:00
Nico Weber	58ff7b5336	LibPDF: Support offset size 3 in CFF index reading ...and replace template instantiations with a loop, to make this easily possible. Vaguely nice for code size as well. Needed for example in 0000054.pdf and 0000354.pdf in 0000.zip in the pdfa dataset.	2023-10-23 09:31:11 -04:00
Nico Weber	3197f0cab6	LibPDF: Handle CFF fonts with charset format 0 and > 255 glyphs better We used to use an u8 as loop counter, which would overflow if there were more than 255 glyphs, producing hundreds of megabytes of Couldn't find string for SID x, going with space output in the process, while all data until the end of the CFF section got interpreted as SIDs, until a try_read() would finally fail. We now no longer fail miserably trying to render page 2 of 0000352.pdf of 0000.zip from the pdfa dataset. Fixes just one crash of the larger 500-document test set, but when I tweak test_pdf.py to print all stacks instead of just the top 5, it no longer produces 260 MB of output.	2023-10-23 09:31:11 -04:00
Nico Weber	0869ca5615	LibPDF: Add more CFF_DEBUG output	2023-10-23 09:31:11 -04:00

1 2 3 4 5 ...

20731 commits