0ct0pu5/ladybird

Author	SHA1	Message	Date
Matthew Olsson	ff00d21d58	Everywhere: Mark a bunch of function parameters as NOESCAPE This fixes the relevant warnings when running LibJSGCVerifier. Note that the analysis is only performed over LibJS-adjacent code, but could be performed over the entire codebase. That will have to wait for a future commit.	2024-04-09 09:10:44 +02:00
Shannon Booth	96fc1741b5	LibWeb: Return an Optional<String> from HTMLToken::attribute Move away from using a nullable StringView.	2023-11-11 08:50:25 +01:00
Shannon Booth	1f8d72da8e	LibWeb: Port HTMLToken::to_deprecated_string to new AK String	2023-11-06 11:37:08 +01:00
Shannon Booth	8fbf72b5bf	LibWeb: Port HTMLToken prefix and namespace to Optional<FlyString> Previously these were DeprecatedStrings that contained a null state. After the null state was removed, the nullability of these members was broken. This doesn't seem to cause any problems currently as the HTML parser is not inserting attributes with their full qualified name, but after we fix that problem, this bug surfaces.	2023-11-05 11:16:16 +00:00
Andreas Kling	3ff81dcb65	LibWeb: Make Web::Namespace::Foo strings be FlyString This required dealing with a lot of fallout, but it's all basically just switching from DeprecatedFlyString to either FlyString or Optional<FlyString> in a hundred places to accommodate the change.	2023-11-04 21:28:30 +01:00
Andreas Kling	b341aeb5c1	LibWeb: Switch HTMLToken and HTMLTokenizer to String & FlyString	2023-11-04 21:28:30 +01:00
Shannon Booth	79ed72adb4	LibWeb: Port HTMLToken::make_start_tag from DeprecatedFlyString	2023-10-08 08:11:48 -04:00
Shannon Booth	d8635fe541	LibWeb: Port HTMLParser local name and value from DeprecatedString	2023-10-08 08:11:48 -04:00
Shannon Booth	e4f8c59210	LibWeb: Port AttributeNames to FlyString	2023-10-08 08:11:48 -04:00
Shannon Booth	9303e9e76f	LibWeb: Port Element::local_name and TagNames from Deprecated String Which pretty much needs to be done together due to the amount of places where they are compared together. This also involves porting over StackOfOpenElements over to FlyString from DeprecatedFly string to prevent a gazillion calls to `.to_deprecated_fly_string` calls in HTMLParser.	2023-10-03 14:47:53 +01:00
Timothy Flynn	fea440055a	LibWeb: Track the byte offset of an HTMLToken's position We currently track the [line, column] position of every HTMLToken, as this is what is needed for LibGUI's syntax highlighting. Some non-LibGUI purposes (e.g. highlighting HTML with HTML) require a byte offset. Track both during tokenization.	2023-08-29 08:11:11 -04:00
Timothy Flynn	5a2bf7fdd1	LibWeb: Set the correct end position of HTML attribute names We were previously setting the end position of attribute names in self- closing HTML tags to the end of the attribute value. To illustrate the previous behavior, consider this tag and its attribute's start and end positions (shown inclusively below): <meta charset="UTF-8" /> ^ name start ^ value start ^ value end ^ name end Rather than setting the end position of the attribute name when we parse the closing slash, ensure the end position is already set while we are in the AttributeName state. We now have: <meta charset="UTF-8" /> ^ name start ^ name end ^ value start ^ value end The tokenizer unit test has been extended to test these positions.	2023-08-25 08:22:24 +02:00
Ben Wiederhake	0184fc5e43	Everywhere: Use AK_MAKE_DEFAULT_MOVABLE to avoid mistakes	2023-06-18 08:47:51 +01:00
Luke Wilde	034aaf3f51	LibWeb: Introduce CustomElementRegistry and creating custom elements The main missing feature here is form associated custom elements.	2023-04-06 11:36:56 +02:00
Timothy Flynn	f3db548a3d	AK+Everywhere: Rename FlyString to DeprecatedFlyString DeprecatedFlyString relies heavily on DeprecatedString's StringImpl, so let's rename it to A) match the name of DeprecatedString, B) write a new FlyString class that is tied to String.	2023-01-09 23:00:24 +00:00
Linus Groh	57dc179b1f	Everywhere: Rename to_{string => deprecated_string}() where applicable This will make it easier to support both string types at the same time while we convert code, and tracking down remaining uses. One big exception is Value::to_string() in LibJS, where the name is dictated by the ToString AO.	2022-12-06 08:54:33 +01:00
Linus Groh	6e19ab2bbc	AK+Everywhere: Rename String to DeprecatedString We have a new, improved string type coming up in AK (OOM aware, no null state), and while it's going to use UTF-8, the name UTF8String is a mouthful - so let's free up the String name by renaming the existing class. Making the old one have an annoying name will hopefully also help with quick adoption :^)	2022-12-06 08:54:33 +01:00
Andreas Kling	97ca45d9c6	LibWeb: Store HTML tag name token data as FlyString while parsing This makes checking if a token is a specific tag O(1) instead of O(n).	2022-10-04 21:30:58 +02:00
Ben Wiederhake	32e98d0924	Libraries: Use AK::Variant default initialization where appropriate	2021-09-21 04:22:52 +04:30
TheFightingCatfish	08359ba578	LibWeb: Fix regression of "contenteditable" attribute	2021-07-31 17:39:28 +02:00
Max Wipfli	ccae0cae45	LibWeb: Rename HTMLToken::doctype_data() => ensure_doctype_data() This renames the accessor to better reflect what it does, as this will allocate a DoctypeData struct if there is none.	2021-07-17 16:24:57 +04:30
Max Wipfli	519a1cdc22	LibWeb: Change HTMLToken storage architecture This completely changes how HTMLTokens store their data. Previously, space was allocated for all token types separately. Now, the HTMLToken's data is stored in just a String, two booleans and a Variant. This change reduces sizeof(HTMLToken) from 68 to 32. Also, this reduces raw tokenization time by around 20 to 50 percent, depending on the page. Full document parsing time (with HTMLDocumentParser, on a local HTML page without any dependency files) is reduced by between 4 and 20 percent, depending on the page. Since tokenizing HTML pages can easily generated 50'000 tokens and more, the storage has been designed in a way that avoids heap allocations where possible, while trying to reduce the size of the tokens. The only tokens which need to allocate on the heap are thus DOCTYPE tokens (max. 1 per document), and tag tokens (but only if they have attributes). This way, only around 5 percent of all tokens generated need to allocate on the heap (except for StringImpl allocations).	2021-07-17 16:24:57 +04:30
Max Wipfli	8a4c44db8c	LibWeb: Make HTMLTokens non-copyable	2021-07-17 16:24:57 +04:30
Max Wipfli	2532bdfabf	LibWeb: Remove friend class declarations from HTMLToken Since all interaction with the HTMLToken class now happens over getters and setters, there is no more need for HTMLTokenizer and HTMLDocumentParser to have direct access to the members.	2021-07-17 16:24:57 +04:30
Max Wipfli	25cba4387b	LibWeb: Add HTMLToken(Type) constructor and use it	2021-07-17 16:24:57 +04:30
Max Wipfli	f2e3c770f9	LibWeb: Use setter for HTMLToken::m_{start,end}_position	2021-07-17 16:24:57 +04:30
Max Wipfli	8b31e41692	LibWeb: Change HTMLToken::m_doctype into named DoctypeData struct This is in preparation for an upcoming storage change of HTMLToken. In contrast to the other token types, the accessor can hand out a mutable reference to allow users to change parts of the DoctypeData easily.	2021-07-17 16:24:57 +04:30
Max Wipfli	918bde98b1	LibWeb: Hide implementation details of HTMLToken attribute list Previously, HTMLToken would expose the Vector<Attribute> directly to its users. In preparation for a future change, all users now use implementation-agnostic APIs which do not expose the Vector directly.	2021-07-17 16:24:57 +04:30
Max Wipfli	15d8635afc	LibWeb: User getter+setter for HTMLToken tag name and self-closing flag	2021-07-17 16:24:57 +04:30
Max Wipfli	1aeafcc58b	LibWeb: Use getter and setter for Character type HTMLTokens While storing the code point in a UTF-8 encoded String in horrendously inefficient, this problem will be addressed at a later stage.	2021-07-17 16:24:57 +04:30
Max Wipfli	e8e9426b4f	LibWeb: User getter and setter for Comment type HTMLTokens	2021-07-17 16:24:57 +04:30
Max Wipfli	f886aa15b8	LibWeb: Rename HTMLToken::AttributeBuilder struct to Attribute This does not contain StringBuilders anymore, so it can do with a simpler name: Attribute.	2021-07-17 16:24:57 +04:30
Max Wipfli	d82f3eb085	LibWeb: Make HTMLToken::{Position,AttributeBuilder} structs public There was and is no reason for those to be private. Making them public also allows us to explicitly specify the return type of some getters.	2021-07-17 16:24:57 +04:30
Max Wipfli	35f32ac170	LibWeb: Change HTMLToken.h to east const style	2021-07-14 23:03:36 +02:00
Gunnar Beutner	c3ad8e9a52	LibWeb: Remove StringBuilder from HTMLToken::m_comment_or_character	2021-07-14 23:03:36 +02:00
Gunnar Beutner	3aa202c432	LibWeb: Remove StringBuilder from HTMLToken::m_tag	2021-07-14 23:03:36 +02:00
Gunnar Beutner	901d71148b	LibWeb: Remove StringBuilders from HTMLToken::AttributeBuilder	2021-07-14 23:03:36 +02:00
Gunnar Beutner	992964aa7d	LibWeb: Remove StringBuilders from HTMLToken::m_doctype	2021-07-14 23:03:36 +02:00
Gunnar Beutner	2150609590	LibWeb: Remove more unused StringBuilders in HTMLToken These fields aren't read anywhere but I didn't feel like removing them outright.	2021-07-14 23:03:36 +02:00
Ali Mohammad Pur	aa7939bc6c	LibWeb: Add position tracking information to HTML tokens	2021-05-20 22:06:45 +02:00
Brian Gianforcaro	1682f0b760	Everything: Move to SPDX license identifiers in all files. SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt *	2021-04-22 11:22:27 +02:00
Andreas Kling	5d180d1f99	Everywhere: Rename ASSERT => VERIFY (...and ASSERT_NOT_REACHED => VERIFY_NOT_REACHED) Since all of these checks are done in release builds as well, let's rename them to VERIFY to prevent confusion, as everyone is used to assertions being compiled out in release. We can introduce a new ASSERT macro that is specifically for debug checks, but I'm doing this wholesale conversion first since we've accumulated thousands of these already, and it's not immediately obvious which ones are suitable for ASSERT.	2021-02-23 20:56:54 +01:00
Andreas Kling	13d7c09125	Libraries: Move to Userland/Libraries/	2021-01-12 12:17:46 +01:00

43 commits