beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-22 15:40:19 +00:00

Author	SHA1	Message	Date
Dennis Camera	b54a1c6284	AK: Implement ShortString for big-endian	2024-07-05 09:49:23 -06:00
Timothy Flynn	698a95d2de	AK: Decode paired UTF-16 surrogates in a JSON string For example, such use is seen on Twitter.	2024-07-04 14:16:16 +02:00
Zaggy1024	bbd8a218a5	AK: Prevent overflow of the min when clamping unsigned values to signed Also, add some tests for the cases that were broken before.	2024-06-24 12:41:32 -06:00
Zaggy1024	172f4588a7	Tests/AK: Add some quick tests for AK::clamp_to	2024-06-24 12:41:32 -06:00
Timothy Flynn	5cf818e305	LibUnicode: Replace case transformations and comparison with ICUs There are a couple of differences here due to using ICU: 1. Titlecasing behaves slightly differently. We previously transformed "123dollars" to "123Dollars", as we would use word segmentation to split a string into words, then transform the first cased character to titlecase. ICU doesn't go quite that far, and leaves the string as "123dollars". While this is a behavior change, the only user of this API is the `text-transform: capitalize;` CSS rule, and we now match the behavior of other browsers. 2. There isn't an API to compare strings with case insensitivity without allocating case-folded strings for both the left- and right-hand-side strings. Our implementation was previously allocation-free; however, in a benchmark, ICU is still ~1.4x faster.	2024-06-20 10:59:55 +02:00
Andreas Kling	b88e0eb50a	AK: Remove unused Complex.h	2024-06-18 12:00:14 +02:00
Andreas Kling	fe1aec124e	AK: Remove unused ArbitrarySizedEnum class	2024-06-18 12:00:14 +02:00
Diego	7560b640f3	AK: Add `AllowSurrogates` to UTF-8 validator The [UTF-8](https://datatracker.ietf.org/doc/html/rfc3629#page-5) standard says to reject strings with upper or lower surrogates. However, in many standards, ECMAScript included, unpaired surrogates (and therefore UTF-8 surrogates) are allowed in strings. So, this commit extends the UTF-8 validation API with `AllowSurrogates`, which will reject upper and lower surrogate characters.	2024-06-09 12:16:32 +02:00
Daniel Bertalan	376b956214	Tests: Stop invoking UB in `AK::NeverDestroyed`'s tests Instead of attempting a stack use-after-free by reading an out-of-scope object's data member, let's keep a flag that checks if the destructor had been called in the outer scope. Fixes #64	2024-06-05 17:19:14 -06:00
Andreas Kling	6321e97b09	AK: Remove various unused things	2024-06-04 09:19:39 +02:00
Timothy Flynn	fe3fde2411	AK+LibUnicode: Implement a case-insensitive variant of find_byte_offset The existing String::find_byte_offset is case-sensitive. This variant allows performing searches using Unicode-aware case folding.	2024-06-01 07:37:54 +02:00
Tim Ledbetter	817bfef3aa	Tests/AK: Add tests for integral log2	2024-05-21 09:31:17 +02:00
Tim Ledbetter	d0d81e470e	AK: Fix off by one error in integral `ceil_log2()` Previously, certain values of `ceil_log2(x)` would be 1 smaller than `ceil(log2(x))`.	2024-05-21 09:31:17 +02:00
Abuneri	b5bed37074	AK: Replace FP math in `is_power_of` with a purely integral algorithm The previous naive approach was causing test failures because of rounding issues in some exotic environments. In particular, MSVC via MSBuild	2024-05-07 16:43:34 -06:00
Nico Weber	88d0702763	AK: Make ceil_div() handle one argument being negative correctly `ceil_div(-1, 2)` used to return -1. Now it returns 0, which is the correct ceil(-0.5). (C++'s division semantics have floor semantics for numbers > 0, but ceil semantics for numbers < 0.) This will be important for the JPEG2000 decoder eventually.	2024-04-27 07:09:08 +02:00
Nico Weber	f2ebad11a8	Tests/AK: Add some basic ceil_div() tests	2024-04-27 07:09:08 +02:00
Timothy Flynn	ec492a1a08	Everywhere: Run clang-format The following command was used to clang-format these files: clang-format-18 -i $(find . \ -not $ -path "./\." -prune $ \ -not $ -path "./Base/" -prune $ \ -not $ -path "./Build/" -prune $ \ -not $ -path "./Toolchain/" -prune $ \ -not $ -path "./Ports/" -prune $ \ -type f -name ".cpp" -o -name ".mm" -o -name ".h") There are a couple of weird cases where clang-format now thinks that a pointer access in an initializer list, e.g. `m_member(ptr->foo)`, is a lambda return statement, and it puts spaces around the `->`.	2024-04-24 16:50:01 -04:00
dgaston	08aaf4fb07	AK: Add methods to BufferedStream to resize the user supplied buffer These changes allow lines of arbitrary length to be read with BufferedStream. When the user supplied buffer is smaller than the line, it will be resized to fit the line. When the internal buffer in BufferedStream is smaller than the line, it will be read into the user supplied buffer chunk by chunk with the buffer growing accordingly. Other behaviors match the behavior of the existing read_line method.	2024-04-21 11:46:55 +02:00
Hendiadyoin1	f95abe8c0e	AK: Make BigIntBase more agnostic to non native word sizes This will allow us to use it in Crypto::UnsignedBigInteger, which always uses 32 bit words	2024-03-25 14:26:29 -06:00
Timothy Flynn	7e38653492	AK: Reject invalid Base64 encoded string lengths	2024-03-25 08:13:27 +01:00
Timothy Flynn	754ff41b9c	AK: Remove whitespace skipping feature from AK's Base64 decoder This was added in commit `f2663f477f` as a partial implementation of what is now LibWeb's forgiving Base64 decoder. All use cases within LibWeb that require whitespace skipping now use that implementation instead. Removing this feature from AK allows us to know the exact output size of a decoded Base64 string. We can still trim whitespace at the start and end of the input though; for example, this is useful when reading from a file that may have a newline at the end of the file.	2024-03-25 08:13:27 +01:00
Dan Klishch	45a0ba2167	AK: Introduce AK::enumerate Co-Authored-By: Tim Flynn <trflynn89@pm.me>	2024-03-23 09:02:58 -04:00
Andrew Kaster	e9b16970fe	AK: Add base64url encoding and decoding methods This encoding scheme comes from section 5 of RFC 4648, as an alternative to the standard base64 encode/decode methods. The only difference is that the last two characters are replaced with '-' and '_', as '+' and '/' are not safe in URLs or filenames.	2024-03-20 12:18:57 -04:00
Shannon Booth	e800605ad3	AK+LibURL: Move AK::URL into a new URL library This URL library ends up being a relatively fundamental base library of the system, as LibCore depends on LibURL. This change has two main benefits: * Moving AK back more towards being an agnostic library that can be used between the kernel and userspace. URL has never really fit that description - and is not used in the kernel. * URL _should_ depend on LibUnicode, as it needs punnycode support. However, it's not really possible to do this inside of AK as it can't depend on any external library. This change brings us a little closer to being able to do that, but unfortunately we aren't there quite yet, as the code generators depend on LibCore.	2024-03-18 14:06:28 -04:00
Timothy Flynn	e4213f5767	AK: Generalize Span::contains_slow to use the Traits infrastructure This allows, for example, checking if a Span<String> contains a value without having to allocate a String.	2024-03-16 08:42:33 +01:00
Timothy Flynn	e3b5e24ce0	AK: Iterate the bytes of a URL query with an unsigned type Otherwise, we percent-encode negative signed chars incorrectly. For example, https://www.strava.com/login contains the following hidden <input> field: <input name="utf8" type="hidden" value="✓" /> On submitting the form, we would percent-encode that field as: utf8=%-1E%-64%-6D Which would cause us to receive an HTTP 500 response. We now properly percent-encode that field as: utf8=%E2%9C%93 And can login to Strava :^)	2024-03-10 15:17:31 +01:00
Timothy Flynn	82ea53cf10	AK: Add a StringView method to count the number of lines in a string We already have a helper to split a StringView by line while considering "\n", "\r", and "\r\n". Add an analagous method to just count the number of lines in the same manner.	2024-03-08 14:43:33 -05:00
Andrew Kaster	21ac431fac	AK: Allow reading from EOF buffered streams better in read_line() If the BufferedStream is able to fill its entire circular buffer in populate_read_buffer() and is later asked to read a line or read until a delimiter, it could erroneously return EMSGSIZE if the caller's buffer was smaller than the internal buffer. In this case, all we really care about is whether the caller's buffer is big enough for however much data we're going to copy into it. Which needs to take into account the candidate.	2024-02-26 13:16:27 -07:00
Nico Weber	986872800e	Tests/AK: Add a test for the array ctor deduction guide	2024-02-11 18:53:00 +01:00
Nico Weber	d84b69ace9	AK: Add to_array() This is useful if you want an array with an explicit type but still want its size to be inferred.	2024-02-11 18:53:00 +01:00
Nico Weber	4409b33145	AK: Make IndexSequence use size_t This makes it possible to use MakeIndexSequqnce in functions like: template<typename T, size_t N> constexpr auto foo(T (&a)[N]) This means AK/StdLibExtraDetails.h must now include AK/Types.h for size_t, which means AK/Types.h can no longer include AK/StdLibExtras.h (which arguably it shouldn't do anyways), which requires rejiggering some things. (IMHO Types.h shouldn't use AK::Details metaprogramming at all. FlatPtr doesn't necessarily have to use Conditional<> and ssize_t could maybe be in its own header or something. But since it's tangential to this PR, going with the tried and true "lift things that cause the cycle up to the top" approach.)	2024-02-11 18:53:00 +01:00
vincent-rg	a9df60ff1c	AK: Update OptionParser::m_arg_index by substracting skipped args On argument swapping to put positional ones toward the end, m_arg_index was pointing at "last arg index" + "skipped args" + "consumed args" and thus was pointing ahead of the skipped ones. m_arg_index now points after the current parsed option arguments.	2024-02-06 00:08:30 +01:00
Dan Klishch	b5f1a48a7c	AK+Everywhere: Remove JsonValue APIs with implicit default values	2024-01-21 15:47:53 -07:00
Dan Klishch	faef802229	AK+GMLCompiler: Remove JsonValue::as_double() Replace its single (non-test) usage with newly created as_number(), which does not leak information about internal integer storage type.	2024-01-21 15:47:53 -07:00
Dan Klishch	5230d2af91	AK+WebContent: Remove JsonValue::as_{i,u}{32,64}()	2024-01-21 15:47:53 -07:00
Tim Ledbetter	65827826fe	AK: Add `CharacterTypes::is_ascii_base36_digit()` This can be used to validate the string passed to `parse_ascii_base36_digit()`.	2024-01-13 19:01:35 -07:00
Tim Ledbetter	bbdbd71439	Tests/AK: Add unit test for Base36 digit parsing	2024-01-13 19:01:35 -07:00
Dan Klishch	ccd701809f	Everywhere: Add deprecated_ prefix to `JsonValue::to_byte_string` `JsonValue::to_byte_string` has peculiar type-erasure semantics which is not usually intended. Unfortunately, it also has a very stereotypical name which does not warn about unexpected behavior. So let's prefix it with `deprecated_` to make new code use `as_string` if it just wants to get string value or `serialized<StringBuilder>` if it needs to do proper serialization.	2024-01-12 17:41:34 -07:00
kleines Filmröllchen	eada4f2ee8	AK: Remove ByteString from GenericLexer A bunch of users used consume_specific with a constant ByteString literal, which can be replaced by an allocation-free StringView literal. The generic consume_while overload gains a requires clause so that consume_specific("abc") causes a more understandable and actionable error.	2024-01-12 17:03:53 -07:00
Martin Janiczek	5a8781393a	AK: Cover TestComplex with more tests Related: - video detailing the process of writing these tests: https://www.youtube.com/watch?v=enxglLlALvI - PR fixing bugs the above effort found: https://github.com/SerenityOS/serenity/pull/22025	2024-01-12 16:42:51 -07:00
Martin Janiczek	d52ffcd830	LibTest: Add more numeric generators Rename unsigned_int generator to number_u32. Add generators: - number_u64 - number_f64 - percentage	2024-01-12 16:42:51 -07:00
Timothy Flynn	8064c9fc4d	AK: Add unit tests for StringUtils::find_last This method was added without tests. Add some now to ensure future changes do not break it.	2024-01-04 11:28:03 -05:00
Timothy Flynn	1b4a23095c	AK: Add a Utf16View::starts_with method Based heavily on Utf8View::starts_with.	2024-01-04 12:43:10 +01:00
Timothy Flynn	c46ba7e68d	AK: Allow constructing a UTF-16 view from a UTF-16 string literal UTF-16 string literals are a language-level feature. It is convenient to be able to construct a Utf16View from these strings.	2024-01-04 12:43:10 +01:00
Aliaksandr Kalenik	e394971209	AK+LibWeb: Use segmented vector to store commands in RecordingPainter Using a vector to represent a list of painting commands results in many reallocations, especially on pages with a lot of content. This change addresses it by introducing a SegmentedVector, which allows fast appending by representing a list as a sequence of fixed-size vectors. Currently, this new data structure supports only the operations used in RecordingPainter, which are appending and iterating.	2023-12-30 23:02:46 +01:00
Timothy Flynn	507a5d8a07	AK: Add an option to zero-fill ByteBuffer data upon growth This is to avoid UB in cases where we need to be able to read from the buffer immediately after resizing it.	2023-12-27 19:30:39 +01:00
Shannon Booth	e2e7c4d574	Everywhere: Use to_number<T> instead of to_{int,uint,float,double} In a bunch of cases, this actually ends up simplifying the code as to_number will handle something such as: ``` Optional<I> opt; if constexpr (IsSigned<I>) opt = view.to_int<I>(); else opt = view.to_uint<I>(); ``` For us. The main goal here however is to have a single generic number conversion API between all of the String classes.	2023-12-23 20:41:07 +01:00
Jesús "gsus" Lapastora	7578620f25	AK/StringUtils: Ensure needle positions don't overlap in replace Previously, `replace` used `find_all` to find all of the positions to replace. But `find_all` finds all the overlapping instances of the needle, while `replace` assumed that the next position was always at least `needle.length()` away from the last one. This led to crashes like https://github.com/SerenityOS/jakt/issues/1159.	2023-12-17 12:00:48 -07:00
Ali Mohammad Pur	5e1499d104	Everywhere: Rename {Deprecated => Byte}String This commit un-deprecates DeprecatedString, and repurposes it as a byte string. As the null state has already been removed, there are no other particularly hairy blockers in repurposing this type as a byte string (what it _really_ is). This commit is auto-generated: $ xs=$(ack -l \bDeprecatedString\b\\|deprecated_string AK Userland \ Meta Ports Ladybird Tests Kernel) $ perl -pie 's/\bDeprecatedString\b/ByteString/g; s/deprecated_string/byte_string/g' $xs $ clang-format --style=file -i \ $(git diff --name-only \| grep \.cpp\\|\.h) $ gn format $(git ls-files '.gn' '.gni')	2023-12-17 18:25:10 +03:30
Tim Schumacher	cb03d3d78f	AK: Allow rejecting BitStream reads beyond EOF	2023-12-01 12:48:18 +01:00

1 2 3 4 5 ...

458 commits