beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-26 01:20:25 +00:00

Author	SHA1	Message	Date
Timothy Flynn	74d644a216	AK: Explicitly check for null data in Utf16View The underlying CPU-specific instructions for operating on UTF-16 strings behave differently for null inputs. Add an explicit check for this state for consistency.	2024-07-21 19:57:07 +02:00
Timothy Flynn	144452d638	AK: Explicitly check for null data in Utf8View The underlying CPU-specific instructions for operating on UTF-8 strings behave differently for null inputs. Add an explicit check for this state for consistency.	2024-07-21 19:57:07 +02:00
Timothy Flynn	71c29504af	AK: Support non-native endianness in Utf16View Utf16View currently assumes host endianness. Add support for specifying either big or little endianness (which we mostly just pipe through to simdutf). This will allow using simdutf facilities with LibTextCodec.	2024-07-18 19:43:57 +02:00
Timothy Flynn	0c14a9417a	AK: Replace converting to and from UTF-16 with simdutf The one behavior difference is that we will now actually fail on invalid code units with Utf16View::to_utf8(AllowInvalidCodeUnits::No). It was arguably a bug that this wasn't already the case.	2024-07-18 14:46:25 +02:00
Andrew Kaster	88044f59c6	AK: Stop exporting AK::FixedPoint into the global namespace This declaration has conflicts with the macOS SDK, which becomes a problem when trying to interact with system clang modules.	2024-07-18 09:43:38 +01:00
Andrew Kaster	bf600c8e1d	AK: Stop exporting AK::Duration into the global namespace This has conflicts with MacTypes.h from the Apple macOS SDKs, which becomes a huge problem when trying to interact with system clang modules	2024-07-18 09:43:38 +01:00
Timothy Flynn	bfc9dc447f	AK+LibWeb: Replace our home-grown base64 encoder/decoders with simdutf We currently have 2 base64 coders: one in AK, another in LibWeb for a "forgiving" implementation. ECMA-262 has an upcoming proposal which will require a third implementation. Instead, let's use the base64 implementation that is used by Node.js and recommended by the upcoming proposal. It handles forgiving decoding as well. Our users of AK's implementation should be fine with the forgiving implementation. The AK impl originally had naive forgiving behavior, but that was removed solely for performance reasons. Using http://mattmahoney.net/dc/enwik8.zip (100MB unzipped) as a test, performance of our old home-grown implementations vs. the simdutf implementation (on Linux x64): Encode Decode AK base64 0.226s 0.169s LibWeb base64 N/A 1.244s simdutf 0.161s 0.047s	2024-07-16 10:27:39 +02:00
Dennis Camera	b54a1c6284	AK: Implement ShortString for big-endian	2024-07-05 09:49:23 -06:00
Timothy Flynn	698a95d2de	AK: Decode paired UTF-16 surrogates in a JSON string For example, such use is seen on Twitter.	2024-07-04 14:16:16 +02:00
Zaggy1024	bbd8a218a5	AK: Prevent overflow of the min when clamping unsigned values to signed Also, add some tests for the cases that were broken before.	2024-06-24 12:41:32 -06:00
Zaggy1024	172f4588a7	Tests/AK: Add some quick tests for AK::clamp_to	2024-06-24 12:41:32 -06:00
Timothy Flynn	5cf818e305	LibUnicode: Replace case transformations and comparison with ICUs There are a couple of differences here due to using ICU: 1. Titlecasing behaves slightly differently. We previously transformed "123dollars" to "123Dollars", as we would use word segmentation to split a string into words, then transform the first cased character to titlecase. ICU doesn't go quite that far, and leaves the string as "123dollars". While this is a behavior change, the only user of this API is the `text-transform: capitalize;` CSS rule, and we now match the behavior of other browsers. 2. There isn't an API to compare strings with case insensitivity without allocating case-folded strings for both the left- and right-hand-side strings. Our implementation was previously allocation-free; however, in a benchmark, ICU is still ~1.4x faster.	2024-06-20 10:59:55 +02:00
Andreas Kling	b88e0eb50a	AK: Remove unused Complex.h	2024-06-18 12:00:14 +02:00
Andreas Kling	fe1aec124e	AK: Remove unused ArbitrarySizedEnum class	2024-06-18 12:00:14 +02:00
Diego	7560b640f3	AK: Add `AllowSurrogates` to UTF-8 validator The [UTF-8](https://datatracker.ietf.org/doc/html/rfc3629#page-5) standard says to reject strings with upper or lower surrogates. However, in many standards, ECMAScript included, unpaired surrogates (and therefore UTF-8 surrogates) are allowed in strings. So, this commit extends the UTF-8 validation API with `AllowSurrogates`, which will reject upper and lower surrogate characters.	2024-06-09 12:16:32 +02:00
Daniel Bertalan	376b956214	Tests: Stop invoking UB in `AK::NeverDestroyed`'s tests Instead of attempting a stack use-after-free by reading an out-of-scope object's data member, let's keep a flag that checks if the destructor had been called in the outer scope. Fixes #64	2024-06-05 17:19:14 -06:00
Andreas Kling	6321e97b09	AK: Remove various unused things	2024-06-04 09:19:39 +02:00
Timothy Flynn	fe3fde2411	AK+LibUnicode: Implement a case-insensitive variant of find_byte_offset The existing String::find_byte_offset is case-sensitive. This variant allows performing searches using Unicode-aware case folding.	2024-06-01 07:37:54 +02:00
Tim Ledbetter	817bfef3aa	Tests/AK: Add tests for integral log2	2024-05-21 09:31:17 +02:00
Tim Ledbetter	d0d81e470e	AK: Fix off by one error in integral `ceil_log2()` Previously, certain values of `ceil_log2(x)` would be 1 smaller than `ceil(log2(x))`.	2024-05-21 09:31:17 +02:00
Abuneri	b5bed37074	AK: Replace FP math in `is_power_of` with a purely integral algorithm The previous naive approach was causing test failures because of rounding issues in some exotic environments. In particular, MSVC via MSBuild	2024-05-07 16:43:34 -06:00
Nico Weber	88d0702763	AK: Make ceil_div() handle one argument being negative correctly `ceil_div(-1, 2)` used to return -1. Now it returns 0, which is the correct ceil(-0.5). (C++'s division semantics have floor semantics for numbers > 0, but ceil semantics for numbers < 0.) This will be important for the JPEG2000 decoder eventually.	2024-04-27 07:09:08 +02:00
Nico Weber	f2ebad11a8	Tests/AK: Add some basic ceil_div() tests	2024-04-27 07:09:08 +02:00
Timothy Flynn	ec492a1a08	Everywhere: Run clang-format The following command was used to clang-format these files: clang-format-18 -i $(find . \ -not $ -path "./\." -prune $ \ -not $ -path "./Base/" -prune $ \ -not $ -path "./Build/" -prune $ \ -not $ -path "./Toolchain/" -prune $ \ -not $ -path "./Ports/" -prune $ \ -type f -name ".cpp" -o -name ".mm" -o -name ".h") There are a couple of weird cases where clang-format now thinks that a pointer access in an initializer list, e.g. `m_member(ptr->foo)`, is a lambda return statement, and it puts spaces around the `->`.	2024-04-24 16:50:01 -04:00
dgaston	08aaf4fb07	AK: Add methods to BufferedStream to resize the user supplied buffer These changes allow lines of arbitrary length to be read with BufferedStream. When the user supplied buffer is smaller than the line, it will be resized to fit the line. When the internal buffer in BufferedStream is smaller than the line, it will be read into the user supplied buffer chunk by chunk with the buffer growing accordingly. Other behaviors match the behavior of the existing read_line method.	2024-04-21 11:46:55 +02:00
Hendiadyoin1	f95abe8c0e	AK: Make BigIntBase more agnostic to non native word sizes This will allow us to use it in Crypto::UnsignedBigInteger, which always uses 32 bit words	2024-03-25 14:26:29 -06:00
Timothy Flynn	7e38653492	AK: Reject invalid Base64 encoded string lengths	2024-03-25 08:13:27 +01:00
Timothy Flynn	754ff41b9c	AK: Remove whitespace skipping feature from AK's Base64 decoder This was added in commit `f2663f477f` as a partial implementation of what is now LibWeb's forgiving Base64 decoder. All use cases within LibWeb that require whitespace skipping now use that implementation instead. Removing this feature from AK allows us to know the exact output size of a decoded Base64 string. We can still trim whitespace at the start and end of the input though; for example, this is useful when reading from a file that may have a newline at the end of the file.	2024-03-25 08:13:27 +01:00
Dan Klishch	45a0ba2167	AK: Introduce AK::enumerate Co-Authored-By: Tim Flynn <trflynn89@pm.me>	2024-03-23 09:02:58 -04:00
Andrew Kaster	e9b16970fe	AK: Add base64url encoding and decoding methods This encoding scheme comes from section 5 of RFC 4648, as an alternative to the standard base64 encode/decode methods. The only difference is that the last two characters are replaced with '-' and '_', as '+' and '/' are not safe in URLs or filenames.	2024-03-20 12:18:57 -04:00
Shannon Booth	e800605ad3	AK+LibURL: Move AK::URL into a new URL library This URL library ends up being a relatively fundamental base library of the system, as LibCore depends on LibURL. This change has two main benefits: * Moving AK back more towards being an agnostic library that can be used between the kernel and userspace. URL has never really fit that description - and is not used in the kernel. * URL _should_ depend on LibUnicode, as it needs punnycode support. However, it's not really possible to do this inside of AK as it can't depend on any external library. This change brings us a little closer to being able to do that, but unfortunately we aren't there quite yet, as the code generators depend on LibCore.	2024-03-18 14:06:28 -04:00
Timothy Flynn	e4213f5767	AK: Generalize Span::contains_slow to use the Traits infrastructure This allows, for example, checking if a Span<String> contains a value without having to allocate a String.	2024-03-16 08:42:33 +01:00
Timothy Flynn	e3b5e24ce0	AK: Iterate the bytes of a URL query with an unsigned type Otherwise, we percent-encode negative signed chars incorrectly. For example, https://www.strava.com/login contains the following hidden <input> field: <input name="utf8" type="hidden" value="✓" /> On submitting the form, we would percent-encode that field as: utf8=%-1E%-64%-6D Which would cause us to receive an HTTP 500 response. We now properly percent-encode that field as: utf8=%E2%9C%93 And can login to Strava :^)	2024-03-10 15:17:31 +01:00
Timothy Flynn	82ea53cf10	AK: Add a StringView method to count the number of lines in a string We already have a helper to split a StringView by line while considering "\n", "\r", and "\r\n". Add an analagous method to just count the number of lines in the same manner.	2024-03-08 14:43:33 -05:00
Andrew Kaster	21ac431fac	AK: Allow reading from EOF buffered streams better in read_line() If the BufferedStream is able to fill its entire circular buffer in populate_read_buffer() and is later asked to read a line or read until a delimiter, it could erroneously return EMSGSIZE if the caller's buffer was smaller than the internal buffer. In this case, all we really care about is whether the caller's buffer is big enough for however much data we're going to copy into it. Which needs to take into account the candidate.	2024-02-26 13:16:27 -07:00
Nico Weber	986872800e	Tests/AK: Add a test for the array ctor deduction guide	2024-02-11 18:53:00 +01:00
Nico Weber	d84b69ace9	AK: Add to_array() This is useful if you want an array with an explicit type but still want its size to be inferred.	2024-02-11 18:53:00 +01:00
Nico Weber	4409b33145	AK: Make IndexSequence use size_t This makes it possible to use MakeIndexSequqnce in functions like: template<typename T, size_t N> constexpr auto foo(T (&a)[N]) This means AK/StdLibExtraDetails.h must now include AK/Types.h for size_t, which means AK/Types.h can no longer include AK/StdLibExtras.h (which arguably it shouldn't do anyways), which requires rejiggering some things. (IMHO Types.h shouldn't use AK::Details metaprogramming at all. FlatPtr doesn't necessarily have to use Conditional<> and ssize_t could maybe be in its own header or something. But since it's tangential to this PR, going with the tried and true "lift things that cause the cycle up to the top" approach.)	2024-02-11 18:53:00 +01:00
vincent-rg	a9df60ff1c	AK: Update OptionParser::m_arg_index by substracting skipped args On argument swapping to put positional ones toward the end, m_arg_index was pointing at "last arg index" + "skipped args" + "consumed args" and thus was pointing ahead of the skipped ones. m_arg_index now points after the current parsed option arguments.	2024-02-06 00:08:30 +01:00
Dan Klishch	b5f1a48a7c	AK+Everywhere: Remove JsonValue APIs with implicit default values	2024-01-21 15:47:53 -07:00
Dan Klishch	faef802229	AK+GMLCompiler: Remove JsonValue::as_double() Replace its single (non-test) usage with newly created as_number(), which does not leak information about internal integer storage type.	2024-01-21 15:47:53 -07:00
Dan Klishch	5230d2af91	AK+WebContent: Remove JsonValue::as_{i,u}{32,64}()	2024-01-21 15:47:53 -07:00
Tim Ledbetter	65827826fe	AK: Add `CharacterTypes::is_ascii_base36_digit()` This can be used to validate the string passed to `parse_ascii_base36_digit()`.	2024-01-13 19:01:35 -07:00
Tim Ledbetter	bbdbd71439	Tests/AK: Add unit test for Base36 digit parsing	2024-01-13 19:01:35 -07:00
Dan Klishch	ccd701809f	Everywhere: Add deprecated_ prefix to `JsonValue::to_byte_string` `JsonValue::to_byte_string` has peculiar type-erasure semantics which is not usually intended. Unfortunately, it also has a very stereotypical name which does not warn about unexpected behavior. So let's prefix it with `deprecated_` to make new code use `as_string` if it just wants to get string value or `serialized<StringBuilder>` if it needs to do proper serialization.	2024-01-12 17:41:34 -07:00
kleines Filmröllchen	eada4f2ee8	AK: Remove ByteString from GenericLexer A bunch of users used consume_specific with a constant ByteString literal, which can be replaced by an allocation-free StringView literal. The generic consume_while overload gains a requires clause so that consume_specific("abc") causes a more understandable and actionable error.	2024-01-12 17:03:53 -07:00
Martin Janiczek	5a8781393a	AK: Cover TestComplex with more tests Related: - video detailing the process of writing these tests: https://www.youtube.com/watch?v=enxglLlALvI - PR fixing bugs the above effort found: https://github.com/SerenityOS/serenity/pull/22025	2024-01-12 16:42:51 -07:00
Martin Janiczek	d52ffcd830	LibTest: Add more numeric generators Rename unsigned_int generator to number_u32. Add generators: - number_u64 - number_f64 - percentage	2024-01-12 16:42:51 -07:00
Timothy Flynn	8064c9fc4d	AK: Add unit tests for StringUtils::find_last This method was added without tests. Add some now to ensure future changes do not break it.	2024-01-04 11:28:03 -05:00
Timothy Flynn	1b4a23095c	AK: Add a Utf16View::starts_with method Based heavily on Utf8View::starts_with.	2024-01-04 12:43:10 +01:00

1 2 3 4 5 ...

465 commits