beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-22 23:50:19 +00:00

Author	SHA1	Message	Date
Tim Ledbetter	23eba28c22	Everywhere: Remove Serenity specific code from tests We no longer run our tests on Serenity.	2024-07-05 07:29:51 +02:00
Ali Mohammad Pur	5e1499d104	Everywhere: Rename {Deprecated => Byte}String This commit un-deprecates DeprecatedString, and repurposes it as a byte string. As the null state has already been removed, there are no other particularly hairy blockers in repurposing this type as a byte string (what it _really_ is). This commit is auto-generated: $ xs=$(ack -l \bDeprecatedString\b\\|deprecated_string AK Userland \ Meta Ports Ladybird Tests Kernel) $ perl -pie 's/\bDeprecatedString\b/ByteString/g; s/deprecated_string/byte_string/g' $xs $ clang-format --style=file -i \ $(git diff --name-only \| grep \.cpp\\|\.h) $ gn format $(git ls-files '.gn' '.gni')	2023-12-17 18:25:10 +03:30
Shannon Booth	1f8d72da8e	LibWeb: Port HTMLToken::to_deprecated_string to new AK String	2023-11-06 11:37:08 +01:00
Shannon Booth	e4f8c59210	LibWeb: Port AttributeNames to FlyString	2023-10-08 08:11:48 -04:00
Timothy Flynn	5a2bf7fdd1	LibWeb: Set the correct end position of HTML attribute names We were previously setting the end position of attribute names in self- closing HTML tags to the end of the attribute value. To illustrate the previous behavior, consider this tag and its attribute's start and end positions (shown inclusively below): <meta charset="UTF-8" /> ^ name start ^ value start ^ value end ^ name end Rather than setting the end position of the attribute name when we parse the closing slash, ensure the end position is already set while we are in the AttributeName state. We now have: <meta charset="UTF-8" /> ^ name start ^ name end ^ value start ^ value end The tokenizer unit test has been extended to test these positions.	2023-08-25 08:22:24 +02:00
Timothy Flynn	5b2bc90b50	LibWeb: Set consistent positions for the start and end of HTML tags To illustrate the previous behavior, consider these tags and their start and end positions (shown inclusively below): Start tag: End tag: <span> </span> ^ start ^ start ^end ^end The start position of a tag is the first ASCII-alpha code point after the opening brace. The start position of a close tag is the slash just before the first ASCII-alpha code point. And the end position of both is the closing brace. So the opening brace is not included in the emitted tag, but the closing brace is. And the end tag including the slash is an oddity that had to be worked around in its only use case (syntax highlighting). We now consistently exclude the braces from the emitted tag, and also exclude the slash from the end tag, so that it does not need to be accounted for in syntax highlighting. That is, we now have: Start tag: End tag: <span> </span> ^ start ^ start ^end ^end The tokenizer unit test has been extended to test these positions.	2023-08-25 08:22:24 +02:00
Tim Schumacher	ae51c1821c	Everywhere: Remove unintentional partial stream reads and writes	2023-03-13 15:16:20 +00:00
Tim Schumacher	d5871f5717	AK: Rename Stream::{read,write} to Stream::{read_some,write_some} Similar to POSIX read, the basic read and write functions of AK::Stream do not have a lower limit of how much data they read or write (apart from "none at all"). Rename the functions to "read some [data]" and "write some [data]" (with "data" being omitted, since everything here is reading and writing data) to make them sufficiently distinct from the functions that ensure to use the entire buffer (which should be the go-to function for most usages). No functional changes, just a lot of new FIXMEs.	2023-03-13 15:16:20 +00:00
Tim Schumacher	874c7bba28	LibCore: Remove `Stream.h`	2023-02-13 00:50:07 +00:00
Tim Schumacher	606a3982f3	LibCore: Move Stream-based file into the `Core` namespace	2023-02-13 00:50:07 +00:00
Ben Wiederhake	0687a75eaa	LibWeb: Run tests in lagom if ENABLE_LAGOM_LIBWEB is set	2023-01-14 15:43:27 -07:00
Linus Groh	57dc179b1f	Everywhere: Rename to_{string => deprecated_string}() where applicable This will make it easier to support both string types at the same time while we convert code, and tracking down remaining uses. One big exception is Value::to_string() in LibJS, where the name is dictated by the ToString AO.	2022-12-06 08:54:33 +01:00
Linus Groh	6e19ab2bbc	AK+Everywhere: Rename String to DeprecatedString We have a new, improved string type coming up in AK (OOM aware, no null state), and while it's going to use UTF-8, the name UTF8String is a mouthful - so let's free up the String name by renaming the existing class. Making the old one have an annoying name will hopefully also help with quick adoption :^)	2022-12-06 08:54:33 +01:00
sin-ack	3f3f45580a	Everywhere: Add sv suffix to strings relying on StringView(char const) Each of these strings would previously rely on StringView's char const constructor overload, which would call __builtin_strlen on the string. Since we now have operator ""sv, we can replace these with much simpler versions. This opens the door to being able to remove StringView(char const*). No functional changes.	2022-07-12 23:11:35 +02:00
Sam Atkins	82605e2dff	Tests: Port TestHTMLTokenizer to Core::Stream	2022-03-10 12:04:22 -05:00
Adam Hodgen	b6eaefa87d	LibWeb: Fix 'Comment end state' in HTML Tokenizer Also, update the expected hash in the LibWeb TestHTMLTokenizer regression test. This is due to the "This comment has a few too many dashes." comment token being updated.	2022-02-21 16:31:45 +01:00
Karol Kosek	fb5e2670d6	LibWeb: Fix highlighting HTML comments Commit `b193351a99` caused the HTML comments to flash when changing the text cursor. Also, when double-clicking on a comment, the selection started from the beginning of the file instead. The following message was displaying when `TOKENIZER_TRACE_DEBUG` was enabled: (Tokenizer::nth_last_position) Invalid position requested: 4th-last of 4. Returning (0-0). Changing the `nth_last_position` to 3 fixes this. I'm guessing that's because the parser is at that moment on the second hyphen of the `<!--` string, so it has to go back only by three characters.	2022-02-14 12:50:44 +03:30
Timothy Flynn	7e63f0eb32	LibWeb: Update TestHTMLTokenizer's expected token hash The output of the tokenizer changed in commit: `b193351a99`.	2022-02-13 17:37:33 +00:00
Andreas Kling	8b1108e485	Everywhere: Pass AK::StringView by value	2021-11-11 01:27:46 +01:00
ovf	898b8ffcb6	LibWeb: Avoid assertion failure on parsing numeric character references	2021-07-28 18:32:22 +02:00
ovf	13c7d55320	LibWeb: Fix parsing of character references in attribute values	2021-07-27 00:03:43 +02:00
Max Wipfli	b6e995ca3c	Tests: Use pointers in TestHTMLTokenizer to avoid copying HTMLTokens	2021-07-17 16:24:57 +04:30
Max Wipfli	918bde98b1	LibWeb: Hide implementation details of HTMLToken attribute list Previously, HTMLToken would expose the Vector<Attribute> directly to its users. In preparation for a future change, all users now use implementation-agnostic APIs which do not expose the Vector directly.	2021-07-17 16:24:57 +04:30
Max Wipfli	2404ad6897	LibWeb: Fix assertion failure when tokenizing JS regex literals This fixes parsing the following regular expression: /</g; It also adds a simple script element to the HTMLTokenizer regression test, which also contains that specific regex.	2021-07-15 01:47:22 +02:00
Max Wipfli	a9a54914bf	Tests: Add comments to the HTMLTokenizer regression test file	2021-07-15 00:48:45 +02:00
Max Wipfli	5a44a0b9f4	Tests: Add a basic test suite for HTMLTokenizer The test suite includes a few basic tests and a very crude regression test, which just concatenates the to_string() of all tokens and checks the String's hash to be equal. This relies on the format of HTMLToken::to_string() to stay the same, which is not ideal.	2021-07-14 23:03:36 +02:00

26 commits