beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-22 07:30:19 +00:00

Author	SHA1	Message	Date
Andreas Kling	a34a5af939	LibGfx: Remove ILBM image format support This format is not supported by other browsers.	2024-06-17 21:57:35 +02:00
Andreas Kling	b6daddb088	LibGfx: Remove JBIG2 image format support This format is not supported by other browsers.	2024-06-17 21:57:35 +02:00
Andreas Kling	681a2ac14e	LibGfx: Remove support for the various "portable" image formats These formats are not supported by other browsers.	2024-06-17 21:57:35 +02:00
Andreas Kling	7141319a7c	LibGfx: Remove DDS image format support This format is not supported by other browsers.	2024-06-17 21:57:35 +02:00
Andreas Kling	2a888ca626	LibGfx: Remove home-grown JPEG codec in favor of libjpeg-turbo	2024-06-17 17:59:54 +02:00
Daniel Bertalan	397774d422	Everywhere: Remove usages of `template` keyword with no parameter list These were made invalid with P1787, and Clang (19) trunk started warning on them with https://github.com/llvm/llvm-project/pull/80801.	2024-06-16 07:19:56 -04:00
Diego	7560b640f3	AK: Add `AllowSurrogates` to UTF-8 validator The [UTF-8](https://datatracker.ietf.org/doc/html/rfc3629#page-5) standard says to reject strings with upper or lower surrogates. However, in many standards, ECMAScript included, unpaired surrogates (and therefore UTF-8 surrogates) are allowed in strings. So, this commit extends the UTF-8 validation API with `AllowSurrogates`, which will reject upper and lower surrogate characters.	2024-06-09 12:16:32 +02:00
circl	666f7338a0	Meta+AK: Clear out unused debug macro definitions	2024-06-09 10:48:19 +02:00
Timothy Flynn	8362c073f3	Everywhere: Remove LibSQL, SQLServer, and the sql REPL :^) It is now entirely unused and replaced by sqlite3.	2024-06-06 11:27:03 -04:00
Andreas Kling	6321e97b09	AK: Remove various unused things	2024-06-04 09:19:39 +02:00
Andreas Kling	e70d96e4e7	Everywhere: Remove a lot more things we don't need	2024-06-03 10:53:53 +02:00
Tim Ledbetter	1a4fbfe495	Everywhere: Remove references to the kernel	2024-06-03 10:53:53 +02:00
Timothy Flynn	fe3fde2411	AK+LibUnicode: Implement a case-insensitive variant of find_byte_offset The existing String::find_byte_offset is case-sensitive. This variant allows performing searches using Unicode-aware case folding.	2024-06-01 07:37:54 +02:00
Daniel Bertalan	637ccacce5	AK: Enable format string checking in Clang builds Format string checking was disabled in Clang-based builds due to a compiler bug: https://github.com/llvm/llvm-project/issues/51182. Now that the requirement has been raised to Clang 17, that is no longer necessary. This has been tested to work correctly with Apple Clang 15.0.0 (which is the least modern supported compiler), as well as CLion 2024.1's bundled Clangd.	2024-05-29 13:34:15 -06:00
Matthew Olsson	e0d6afbabe	ClangPlugins: Invert the lambda detection escape mechanism Instead of being opt-out with NOESCAPE, it is now opt-in with ESCAPING. Opt-out is ideal, but unfortunately this was extremely noisy when compiling the entire codebase. Escaping functions are rarer than non- escaping ones, so let's just go with that for now. This also allows us to gradually add heuristics for detecting missing ESCAPING annotations and emitting them as errors. It also nicely matches the spelling that Swift uses (@escaping), which is where this idea originally came from.	2024-05-22 21:55:34 -06:00
Matthew Olsson	a5f4c9a632	AK+Userland: Remove NOESCAPE See the next commit for an explanation	2024-05-22 21:55:34 -06:00
Dan Klishch	38b51b791e	AK+Kernel+LibVideo: Include workarounds for missing P0960 only in Xcode With this change, ".make." function family now does error checking earlier, which improves experience while using clangd. Note that the change also make them instantiate classes a bit more eagerly, so in LibVideo/PlaybackManager, we have to first define SeekingStateHandler and only then make() it. Co-Authored-By: stelar7 <dudedbz@gmail.com>	2024-05-21 14:24:59 +02:00
Tim Ledbetter	d0d81e470e	AK: Fix off by one error in integral `ceil_log2()` Previously, certain values of `ceil_log2(x)` would be 1 smaller than `ceil(log2(x))`.	2024-05-21 09:31:17 +02:00
Dan Klishch	be36dbce7d	AK: Don't put element count next to heap-allocated data in FixedArray This not only makes code easier to follow but also makes it faster.	2024-05-18 18:30:42 +02:00
Lucas CHOLLET	c6e4563489	AK: Export Statistics to the global namespace	2024-05-18 18:30:07 +02:00
Andreas Kling	b2e6843055	LibJS+AK: Fix integer overflow UB on (any Int32 - -2147483648) It wasn't safe to use addition_would_overflow(a, -b) to check if subtraction (a - b) would overflow, since it doesn't cover this case. I don't know why we didn't have subtraction_would_overflow(), so this patch adds it. :^)	2024-05-18 18:11:50 +02:00
Sönke Holz	b6cc95c38e	AK: Add a function for frame pointer-based stack unwinding Instead of duplicating stack unwinding code everywhere, introduce a new AK helper to unwind the stack in a generic way.	2024-05-14 14:02:06 -06:00
ptrcnull	13e44ab035	AK: Add stack size fixup for musl libc Fixes #16681	2024-05-14 13:56:45 -06:00
Andreas Kling	6b2b90d2b0	AK: Remove AK_HAS_CONDITIONALLY_TRIVIAL Code behind this appears to compile nicely with Clang 17 and later.	2024-05-10 15:03:24 +00:00
implicitfield	f923016e0b	AK: Add `reinterpret_as_octal()` This is useful for parsing user-provided integers that should be interpreted as octals.	2024-05-07 16:54:27 -06:00
Abuneri	b5bed37074	AK: Replace FP math in `is_power_of` with a purely integral algorithm The previous naive approach was causing test failures because of rounding issues in some exotic environments. In particular, MSVC via MSBuild	2024-05-07 16:43:34 -06:00
Andreas Kling	ebe6ec6069	AK: Check for u32 overflow in String::repeated() I don't know why this was checking for size_t overflow, but it was tripping up ASAN malloc() checks by passing a way-too-large size.	2024-05-07 09:15:40 +02:00
Nico Weber	c421a3d7ce	AK: Add missing using statements to Find.h	2024-05-06 17:32:19 +02:00
Sergey Bugaev	0bb37f9c0e	AK: Include <features.h> before checking for platform macros AK/Platform.h did not include any other header file, but expected various macros to be defined. While many of the macros checked here are predefined by the compiler (i.e. GCC's TARGET_OS_CPP_BUILTINS), some may be defined by the system headers instead. In particular, so is __GLIBC__ on glibc-based systems. We have to include some system header for getting __GLIBC__ (or not). It could be possible to include something relatively small and innocuous, like <string.h> for example, but that would still clutter the name space and make other code that would use <string.h> functionality, but forget to include it, build on accident; we wouldn't want that. At the end of the day, the header that actually defines __GLIBC__ (or not) is <features.h>. It's typically included from other glibc headers, and not by user code directly, which makes it unlikely to mask other code accidentlly forgetting to include it, since it wouldn't include it in the first place. <features.h> is not defined by POSIX and could be missing on other systems (but it seems to be present at least when using either glibc or musl), so guard its inclusion with __has_include(). Specifically, this fixes AK/StackInfo.cpp not picking up the glibc code path in the cross aarch64-gnu (GNU/Hurd on 64-bit ARM) Lagom build.	2024-05-02 07:46:53 -06:00
Tim Ledbetter	8b01abf9f7	AK: Don't move trivially copyable types in BufferedStream methods	2024-04-30 13:22:56 +02:00
Liav A.	122c82a2a1	AK: Add the SetOnce class The SetOnce class is meant to be used as one-time set boolean flag, which is useful for flags that change only once and then stay immutable forever.	2024-04-26 23:46:23 -06:00
Nico Weber	88d0702763	AK: Make ceil_div() handle one argument being negative correctly `ceil_div(-1, 2)` used to return -1. Now it returns 0, which is the correct ceil(-0.5). (C++'s division semantics have floor semantics for numbers > 0, but ceil semantics for numbers < 0.) This will be important for the JPEG2000 decoder eventually.	2024-04-27 07:09:08 +02:00
Timothy Flynn	fecd08ce64	Everywhere: Remove 'clang-format off' comments that are no longer needed	2024-04-24 16:50:01 -04:00
Timothy Flynn	ec492a1a08	Everywhere: Run clang-format The following command was used to clang-format these files: clang-format-18 -i $(find . \ -not $ -path "./\." -prune $ \ -not $ -path "./Base/" -prune $ \ -not $ -path "./Build/" -prune $ \ -not $ -path "./Toolchain/" -prune $ \ -not $ -path "./Ports/" -prune $ \ -type f -name ".cpp" -o -name ".mm" -o -name ".h") There are a couple of weird cases where clang-format now thinks that a pointer access in an initializer list, e.g. `m_member(ptr->foo)`, is a lambda return statement, and it puts spaces around the `->`.	2024-04-24 16:50:01 -04:00
kleines Filmröllchen	8443d0a74d	AK: Use common ComponentType integer type for float bitfields This allows us to easily use an appropriate integer type when performing float bitfield operations. This change also adds a comment about the technically-incorrect 80-bit extended float mantissa field.	2024-04-23 19:18:09 -06:00
Andrew Kaster	913cffe928	AK: Add workaround for faulty Sanitizer warning on gcc 13+ in Atomic gcc can't seem to figure out that the address of a member variable of AK::Atomic<u32> in AtomicRefCounted cannot be null when fetch_sub-ing. Add a bogus condition to convince the compiler that it can't be null.	2024-04-23 15:37:07 -06:00
dgaston	08aaf4fb07	AK: Add methods to BufferedStream to resize the user supplied buffer These changes allow lines of arbitrary length to be read with BufferedStream. When the user supplied buffer is smaller than the line, it will be resized to fit the line. When the internal buffer in BufferedStream is smaller than the line, it will be read into the user supplied buffer chunk by chunk with the buffer growing accordingly. Other behaviors match the behavior of the existing read_line method.	2024-04-21 11:46:55 +02:00
Jess	ecb7d4b40f	LibJS: Throw RangeError in `StringPrototype::repeat` if OOM currently crashes with an assertion failure in `String::repeated` if malloc can't serve a `count * input_size` sized request, so add `String::repeated_with_error` to propagate the error.	2024-04-20 19:23:46 -04:00
Andrew Kaster	1e749d023a	AK: Add fallible dequeue method to Queue	2024-04-19 16:38:55 -04:00
Dan Klishch	5ed7cd6e32	Everywhere: Use east const in more places These changes are compatible with clang-format 16 and will be mandatory when we eventually bump clang-format version. So, since there are no real downsides, let's commit them now.	2024-04-19 06:31:19 -04:00
implicitfield	1159cd9390	AK+Kernel+LibSanitizer: Implement __ubsan_handle_function_type_mismatch	2024-04-18 13:14:33 -06:00
Space Meyer	fdc0328ce3	Kernel: Exclude individual functions from coverage instrumentation Sticking this to the function source has multiple benefits: - We instrument more code, by not excluding entire files. - NO_SANITIZE_COVERAGE can be used in Header files. - Keeping the info with the source code, means if a function or file is moved around, the NO_SANITIZE_COVERAGE moves with it.	2024-04-15 21:16:22 -06:00
Space Meyer	7d8431dcfc	AK: Toolchain dependend instrumentation __attribute__ GCC sometimes complains about the The `no_sanitize("address")` syntax, and clang sometimes complains abouth the `no_sanitize_address` syntax. Both claim to support both, so that's neat!	2024-04-15 21:16:22 -06:00
Andrew Kaster	8c5e64e686	Ladybird+LibWebView: Add mechanism to get Mach task port for helpers On macOS, it's not trivial to get a Mach task port for your children. This implementation registers the chrome process as a well-known service with launchd based on its pid, and lets each child process send over a reference to its mach_task_self() back to the chrome. We'll need this Mach task port right to get process statistics.	2024-04-09 16:43:27 -06:00
Andrew Kaster	4a9546a7c8	AK: Add platform macro for Mach-based operating system environments	2024-04-09 16:43:27 -06:00
Matthew Olsson	76fa127cbf	LibJSGCVerifier: Detect stack-allocated ref captures in lambdas For example, consider the following code snippet: Vector<Function<void()>> m_callbacks; void add_callback(Function<void()> callback) { m_callbacks.append(move(callback)); } // Somewhere else... void do_something() { int a = 10; add_callback([&a] { dbgln("a is {}", a); }); } // Oops, "a" is now destroyed, but the callback in m_callbacks // has a reference to it! We now statically detect the capture of "a" in the lambda above and flag it as incorrect. Note that capturing the value implicitly with a capture list of `[&]` would also be detected. Of course, many functions that accept Function<...> don't store them anywhere, instead immediately invoking them inside of the function. To avoid a warning in this case, the parameter can be annotated with NOESCAPE to indicate that capturing stack variables is fine: void do_something_now(NOESCAPE Function<...> callback) { callback(...) } Lastly, there are situations where the callback does generally escape, but where the caller knows that it won't escape long enough to cause any issues. For example, consider this fake example from LibWeb: void do_something() { bool is_done = false; HTML::queue_global_task([&] { do_some_work(); is_done = true; }); HTML::main_thread_event_loop().spin_until([&] { return is_done; }); } In this case, we know that the lambda passed to queue_global_task will be executed before the function returns, and will not persist afterwards. To avoid this warning, annotate the type of the capture with IGNORE_USE_IN_ESCAPING_LAMBDA: void do_something() { IGNORE_USE_IN_ESCAPING_LAMBDA bool is_done = false; // ... }	2024-04-09 09:10:44 +02:00
stelar7	3f1019b089	AK: Add XOR method to ByteBuffer	2024-04-08 09:34:49 -06:00
Shannon Booth	8c34842962	AK: Simplify and optimize ASCIICaseInsensitiveFlyStringTraits::equals The member function `equals_ignoring_ascii_case` has a fast path which will return early if it is the same FlyString instance.	2024-04-06 09:17:51 -04:00
Timothy Flynn	c5c5e52c24	AK: Disallow calling ByteString methods that return a view on rvalues This prevents, for example: StringView view = ByteString { "foo" }.view(); This prevents a class of potential UAF.	2024-04-04 11:23:21 +02:00
Timothy Flynn	de80f544d8	AK: Disallow calling String methods that return a view on rvalues This prevents, for example: StringView view = "foo"_string.bytes_as_string_view(); This prevents a class of potential UAF.	2024-04-04 11:23:21 +02:00
Timothy Flynn	b5f22b6e90	AK+Userland: Remove some needlessly explicit conversions to StringView	2024-04-04 11:23:21 +02:00
Timothy Flynn	e0bddbb65e	AK: Add a Stream::write_until_depleted overload for string types All string types currently have to invoke this function as: stream.write_until_depleted("foo"sv.bytes()); This isn't very ergonomic, but more importantly, this overload will allow String/ByteString instances to be written in this manner once e.g. `ByteString::view() &&` is deleted.	2024-04-04 11:23:21 +02:00
Timothy Flynn	c7ea710b55	AK: Return a constant reference from JsonValue::as_string Rather than making a copy of the held string, this returns a reference so that expressions like the following: do_something(json.as_string().view()); are not disallowed once `ByteString::view() &&` is deleted.	2024-04-04 11:23:21 +02:00
Andreas Kling	3881717103	LibJS+AK: Register GC memory as root regions for LeakSanitizer This should fix the gigantic list of false positives dumped by LeakSanitizer on exit .	2024-04-03 12:41:02 +02:00
Hendiadyoin1	877cfe1890	AK: Move generalized internals of UFixedBigIntDivision to BigIntBase We will reuse this in LibCrypto Co-Authored-By: Dan Klishch <danilklishch@gmail.com>	2024-03-25 14:26:29 -06:00
Hendiadyoin1	9045840e33	AK: Use correct wide integer type for qhat check in UFixedBigIntDivision Previously, we were assuming that were always on a 64-bit platform, which is not 100% correct	2024-03-25 14:26:29 -06:00
Hendiadyoin1	f95abe8c0e	AK: Make BigIntBase more agnostic to non native word sizes This will allow us to use it in Crypto::UnsignedBigInteger, which always uses 32 bit words	2024-03-25 14:26:29 -06:00
Nico Weber	1ab28276f6	LibGfx: Add the start of a JPEG2000 loader JPEG2000 is the last image format used in PDF filters that we don't have a loader for. Let's change that. This adds all the scaffolding, but no actual implementation yet.	2024-03-25 20:35:00 +01:00
Nico Weber	07750774cf	AK: Allow creating a MaybeOwned<Superclass> from a MaybeOwned<Subclass>	2024-03-25 20:35:00 +01:00
Andreas Kling	2b8a920a7c	AK: Don't blindly use SipHash as default hash function Although it has some interesting properties, SipHash is brutally slow compared to our previous hash function. Since its introduction, it has been highly visible in every profile of doing anything interesting with LibJS or LibWeb. By switching back, we gain a 10x speedup for 32-bit hashes, and "only" a 3x speedup for 64-bit hashes. This comes out to roughly 1.10x faster HashTable insertion, and roughly 2.25x faster HashTable lookup. Hashing is no longer at the top of profiles and everything runs measurably faster. For security-sensitive hash tables with user-controlled inputs, we can opt into SipHash selectively on a case-by-case basis. The vast majority of our uses don't fit that description though.	2024-03-25 12:39:23 +01:00
Timothy Flynn	7e38653492	AK: Reject invalid Base64 encoded string lengths	2024-03-25 08:13:27 +01:00
Timothy Flynn	4ecf4c7617	AK: Compute the exact size of decoded Base64 strings	2024-03-25 08:13:27 +01:00
Timothy Flynn	754ff41b9c	AK: Remove whitespace skipping feature from AK's Base64 decoder This was added in commit `f2663f477f` as a partial implementation of what is now LibWeb's forgiving Base64 decoder. All use cases within LibWeb that require whitespace skipping now use that implementation instead. Removing this feature from AK allows us to know the exact output size of a decoded Base64 string. We can still trim whitespace at the start and end of the input though; for example, this is useful when reading from a file that may have a newline at the end of the file.	2024-03-25 08:13:27 +01:00
Timothy Flynn	690db10463	AK: Convert Base64 template parameters to regular function parameters The generated function name is otherwise very long, which makes stack traces a bit more difficult to sift through.	2024-03-25 08:13:27 +01:00
Timothy Flynn	f292746134	AK: Convert some west-consts to east-const in Base64.cpp Caught by clang-format-17. Note that clang-format-16 is fine with this as well (it leaves the const placement alone), it just doesn't perform the formatting to east-const itself.	2024-03-25 08:13:27 +01:00
Andreas Kling	3bdfca1119	AK: Make FlyString::from_utf8*() avoid allocation if possible If we already have a FlyString instantiated for the given string, look that up and return it instead of making a temporary String just to use as a key into the FlyString table.	2024-03-24 13:28:24 +01:00
Andreas Kling	8d7a1e5654	LibWeb: Skip some redundant UTF-8 validation in CSS tokenizer If we're just adding code points to a StringBuilder, there's no need to revalidate the result.	2024-03-24 13:28:24 +01:00
Andreas Kling	a88799c032	AK: Remove excessive hashing caused by FlyString table Before this change, the global FlyString table looked like this: HashMap<StringView, Detail::StringBase> After this change, we have: HashTable<Detail::StringData const*, FlyStringTableHashTraits> The custom hash traits are used to extract the stored hash from StringData which avoids having to rehash the StringView repeatedly like we did before. This necessitated a handful of smaller changes to make it work.	2024-03-24 13:28:24 +01:00
Andreas Kling	8bfad24708	AK: Move AK::Detail::StringData to its own header file This will allow us to access it from FlyString.cpp	2024-03-24 13:28:24 +01:00
Dan Klishch	45a0ba2167	AK: Introduce AK::enumerate Co-Authored-By: Tim Flynn <trflynn89@pm.me>	2024-03-23 09:02:58 -04:00
Stanisław Wiśniewski	994fe0b89f	AK: Use `else if constexpr` in explode_byte()	2024-03-21 14:35:20 -06:00
Timothy Flynn	81ad6de41b	AK: Avoid creating an intermediate buffer when decoding a Base64 string There's no need to copy the result. We can also avoid increasing the size of the output buffer by 1 for each written byte. This reduces the runtime of `./bin/base64 -d enwik8.base64 >/dev/null` from 0.917s to 0.632s. (enwik8 is a 100MB test file from http://mattmahoney.net/dc/enwik8.zip)	2024-03-21 15:53:46 +01:00
Timothy Flynn	0fd7ad09a0	AK: Avoid StringBuilder when creating a Base64-encoded string We don't really need the features provided by StringBuilder here, since we know the exact size of the output. Avoiding StringBuilder avoids the recurring capacity/size checks both within StringBuilder itself and its internal ByteBuffer. This reduces the runtime of `./bin/base64 enwik8 >/dev/null` from 0.976s to 0.428s. (enwik8 is a 100MB test file from http://mattmahoney.net/dc/enwik8.zip)	2024-03-21 15:53:46 +01:00
Timothy Flynn	5f5b8ee9bb	AK: Do not perform UTF-8 validation on Base64-encoded strings We know we are only appending ASCII characters to the StringBuilder, so do not bother validating the result. This reduces the runtime of `./bin/base64 enwik8 >/dev/null` from 1.192s to 0.976s. (enwik8 is a 100MB test file from http://mattmahoney.net/dc/enwik8.zip)	2024-03-21 15:53:46 +01:00
Andrew Kaster	e9b16970fe	AK: Add base64url encoding and decoding methods This encoding scheme comes from section 5 of RFC 4648, as an alternative to the standard base64 encode/decode methods. The only difference is that the last two characters are replaced with '-' and '_', as '+' and '/' are not safe in URLs or filenames.	2024-03-20 12:18:57 -04:00
Shannon Booth	e800605ad3	AK+LibURL: Move AK::URL into a new URL library This URL library ends up being a relatively fundamental base library of the system, as LibCore depends on LibURL. This change has two main benefits: * Moving AK back more towards being an agnostic library that can be used between the kernel and userspace. URL has never really fit that description - and is not used in the kernel. * URL _should_ depend on LibUnicode, as it needs punnycode support. However, it's not really possible to do this inside of AK as it can't depend on any external library. This change brings us a little closer to being able to do that, but unfortunately we aren't there quite yet, as the code generators depend on LibCore.	2024-03-18 14:06:28 -04:00
Andreas Kling	6724f840cd	AK: Early return from empty hash table lookups to avoid hashing When calling get() or find() on an empty HashTable or HashMap, we can avoid hashing the sought-after key.	2024-03-16 14:27:59 +01:00
Timothy Flynn	e4213f5767	AK: Generalize Span::contains_slow to use the Traits infrastructure This allows, for example, checking if a Span<String> contains a value without having to allocate a String.	2024-03-16 08:42:33 +01:00
Timothy Flynn	faf4ba63c2	AK: Don't use east-constexpr in Span methods	2024-03-16 08:42:33 +01:00
Ali Mohammad Pur	d451f84f31	LibCrypto: Add a minimal DER encoder Progress towards #23562.	2024-03-16 01:17:02 -06:00
Andreas Kling	d125a76f85	AK: Make FlyString-to-FlyString comparison inline & trivial This should never boil down to more than a machine word comparison.	2024-03-14 12:42:08 +01:00
Ali Mohammad Pur	8003bde03d	AK+LibRegex+LibWasm: Remove the non-const COWVector::operator[] This was copying the vector behind our backs, let's remove it and make the copying explicit by putting it behind COWVector::mutable_at(). This is a further 64% performance improvement on Wasm validation.	2024-03-12 17:10:47 +01:00
Ali Mohammad Pur	cefe177a56	AK+LibRegex: Move COWVector to AK This is about to gain a new user, so move it to AK.	2024-03-12 17:10:47 +01:00
Timothy Flynn	e3b5e24ce0	AK: Iterate the bytes of a URL query with an unsigned type Otherwise, we percent-encode negative signed chars incorrectly. For example, https://www.strava.com/login contains the following hidden <input> field: <input name="utf8" type="hidden" value="✓" /> On submitting the form, we would percent-encode that field as: utf8=%-1E%-64%-6D Which would cause us to receive an HTTP 500 response. We now properly percent-encode that field as: utf8=%E2%9C%93 And can login to Strava :^)	2024-03-10 15:17:31 +01:00
Nico Weber	58838db445	LibGfx: Add the start of a JBIG2 loader JBIG2 is infamous for two things: 1. It's used in xerox scanners were it falsifies scanned numbers: https://www.dkriesel.com/en/blog/2013/0802_xerox-workcentres_are_switching_written_numbers_when_scanning 2. It was allegedly used in an iOS zero day, in a very cool way: https://googleprojectzero.blogspot.com/2021/12/a-deep-dive-into-nso-zero-click.html Needless to say, we need support for it in Serenity. (...because it's used in PDF files.) This adds all the scaffolding, but no actual implementation yet. It's enough for `file` to print the mime type of .jb2 files, but `image` can't do anything with the files yet.	2024-03-09 16:01:22 +01:00
Timothy Flynn	82ea53cf10	AK: Add a StringView method to count the number of lines in a string We already have a helper to split a StringView by line while considering "\n", "\r", and "\r\n". Add an analagous method to just count the number of lines in the same manner.	2024-03-08 14:43:33 -05:00
Timothy Flynn	07a27b2ec0	AK: Replace the boolean parameter of StringView::lines with a named enum	2024-03-08 14:43:33 -05:00
Matthew Olsson	a511f1ef85	AK: Add HashMap::ensure_capacity	2024-03-06 07:45:56 +01:00
Filiph Siitam Sandström	fd694e8672	AK+Lagom: Make it possible to build for iOS This commit makes it possible to build AK and most of Lagom for iOS, based on the work for the Ladybird build demoed on discord: https://discord.com/channels/830522505605283862/830525031720943627/1211987732646068314	2024-03-03 13:13:42 -07:00
Hendiadyoin1	79fd8eb28d	AK/HashMap: Use structured bindings when iterating over itself	2024-03-01 14:05:53 -07:00
Nico Weber	f8b8d1b3be	AK: Add is_ascii_uppercase_hex_digit()	2024-03-01 14:17:42 +01:00
Timothy Flynn	d878975f95	AK+LibJS: Remove OFFSET_OF and its users With the LibJS JIT removed, let's not expose pointers to internal members.	2024-02-29 09:00:00 +01:00
Andrew Kaster	21ac431fac	AK: Allow reading from EOF buffered streams better in read_line() If the BufferedStream is able to fill its entire circular buffer in populate_read_buffer() and is later asked to read a line or read until a delimiter, it could erroneously return EMSGSIZE if the caller's buffer was smaller than the internal buffer. In this case, all we really care about is whether the caller's buffer is big enough for however much data we're going to copy into it. Which needs to take into account the candidate.	2024-02-26 13:16:27 -07:00
Dan Klishch	ba24e86fdd	AK: Introduce IntrusiveBinaryHeap and reimplement BinaryHeap using it The main difference between them is that IntrusiveBinaryHeap can optionally maintain an index inside every stored node that allows arbitrary nodes to be deleted.	2024-02-25 17:24:36 -07:00
Hendiadyoin1	38cb5444d9	AK: Make StringView::for_each_split_view() aware of IterationDecision	2024-02-24 16:43:44 -07:00
Dan Klishch	8ac0e3f0e5	AK+LibJS: Remove null state from DeprecatedFlyString :^)	2024-02-24 15:06:52 -07:00
Dan Klishch	061f902f95	AK+Userland: Introduce ByteString::create_and_overwrite And replace two users of raw StringImpl with it.	2024-02-24 15:06:52 -07:00
Ali Mohammad Pur	bc301b6f40	AK+LibXML+JSSpecCompiler: Move LineTrackingLexer to AK This is a simple extension of GenericLexer, and is used in more than just LibXML, so let's move it into AK. The move also resolves a FIXME, which is removed in this commit.	2024-02-16 15:26:43 +01:00
Lucas CHOLLET	cbfea68ed8	AK: Add `BigEndianInputBitStream::bits_until_next_byte_boundary()`	2024-02-12 14:08:56 +01:00
Nico Weber	d84b69ace9	AK: Add to_array() This is useful if you want an array with an explicit type but still want its size to be inferred.	2024-02-11 18:53:00 +01:00
Nico Weber	10216e1743	AK: Remove a stray `static` No behavior change.	2024-02-11 18:53:00 +01:00
Nico Weber	4409b33145	AK: Make IndexSequence use size_t This makes it possible to use MakeIndexSequqnce in functions like: template<typename T, size_t N> constexpr auto foo(T (&a)[N]) This means AK/StdLibExtraDetails.h must now include AK/Types.h for size_t, which means AK/Types.h can no longer include AK/StdLibExtras.h (which arguably it shouldn't do anyways), which requires rejiggering some things. (IMHO Types.h shouldn't use AK::Details metaprogramming at all. FlatPtr doesn't necessarily have to use Conditional<> and ssize_t could maybe be in its own header or something. But since it's tangential to this PR, going with the tried and true "lift things that cause the cycle up to the top" approach.)	2024-02-11 18:53:00 +01:00
Tim Ledbetter	4a7236cabf	Everywhere: Prefer `_string` when constructing strings from literals	2024-02-08 11:01:10 -05:00
Dan Klishch	88af15d513	AK: Store JsonValue's value in AK::Variant	2024-02-08 08:04:05 -07:00
Andrew Kaster	bc9c710904	LibWeb: Hide WebDriver::match_route debug behind its own flag When enabling WEBDRIVER_DEBUG globally, this function's debug spam overpowers the rest of the useful logs.	2024-02-08 15:53:46 +01:00
Dan Klishch	677bcea771	ntpquery: Use AK::convert_between_host_and_network_endian Instead of polluting global namespace with definitions from libkern/OSByteOrder.h and machine/endian.h on MacOS, just use AK functions for conversions.	2024-02-06 04:37:47 -07:00
vincent-rg	a9df60ff1c	AK: Update OptionParser::m_arg_index by substracting skipped args On argument swapping to put positional ones toward the end, m_arg_index was pointing at "last arg index" + "skipped args" + "consumed args" and thus was pointing ahead of the skipped ones. m_arg_index now points after the current parsed option arguments.	2024-02-06 00:08:30 +01:00
Dan Klishch	3e43d15440	Everywhere: Prefer `VERIFY` over `assert()`	2024-02-05 07:03:53 -05:00
Nico Weber	41f57a5477	AK: Remove the SIMD version of rsqrt() too, for good measure No strong reason to remove this one, other than that it's also unused.	2024-01-30 10:02:33 +01:00
Nico Weber	a1f70b39fa	AK: Remove rsqrt() At least on arm64, this isn't very preciese: https://github.com/SerenityOS/serenity/issues/22739#issuecomment-1912909835 It is also now unused.	2024-01-30 10:02:33 +01:00
Shannon Booth	c6319d68c3	AK: Introduce EquivalentFunctionType This allows you to get the type from a function from some given callable 'T'. Co-Authored-By: Ali Mohammad Pur <mpfard@serenityos.org>	2024-01-27 21:40:25 -05:00
Ali Mohammad Pur	0e61d039c9	AK: Use IsSame<FlatPtr, T> instead of __LP64__ to guess FlatPtr's type Instead of playing the guessing game, simply use whatever type FlatPtr itself resolves to.	2024-01-28 04:30:33 +03:30
Sam Atkins	388856dc7e	AK+Userland: Return String from human_readable_size() functions	2024-01-25 09:07:32 +01:00
Sam Atkins	7e8cfb60eb	AK+Userland: Return String from human_readable_[digital_]time()	2024-01-25 09:07:32 +01:00
Dan Klishch	870a947040	AK: Remove StringInternals.h Since we do not expose memory layout anymore in StringBase, there is no need to keep StringData public.	2024-01-21 16:16:15 -07:00
Dan Klishch	611adf1591	AK: Make the state of StringBase private Now it actually only exposes methods to allocate uninitialized storage and to create substring with a shared superstring. All the details of the memory layout are fully encapsulated.	2024-01-21 16:16:15 -07:00
Dan Klishch	fa52f68142	AK: Store data in FlyString as StringBase Unfortunately, it is not clear to me how to split this commit into several atomic ones.	2024-01-21 16:16:15 -07:00
Dan Klishch	e7700e16ee	AK: Forward substring creation with shared superstring to StringBase	2024-01-21 16:16:15 -07:00
Dan Klishch	5d6cd65e29	AK: Simplify String::repeated by leveraging StringBase helpers	2024-01-21 16:16:15 -07:00
Dan Klishch	7dbe357e9f	AK: Simplify String::from_stream by leveraging StringBase helpers	2024-01-21 16:16:15 -07:00
Dan Klishch	7506736869	AK: Stop using ShortString in String::from_code_point Refactor it to use StringBase::replace_with_new_short_string instead.	2024-01-21 16:16:15 -07:00
Dan Klishch	dcd1fda9c8	AK: Introduce StringBase::replace_with_new_{short_,}string	2024-01-21 16:16:15 -07:00
Dan Klishch	d6290c4684	AK: Move String::hash() and String::String() to StringBase	2024-01-21 16:16:15 -07:00
Dan Klishch	1b09a1851e	AK: Move String::~String() and String::destroy_string() to StringBase	2024-01-21 16:16:15 -07:00
Dan Klishch	54d149bc25	AK: Move String::bytes() and String::operator==(String) to StringBase The idea is to eventually get rid of protected state in StringBase. To do this, we first need to remove all references to m_data and m_short_string from String.	2024-01-21 16:16:15 -07:00
Dan Klishch	4364a28d3d	AK: Move data fields from AK::String to a newly created AK::StringBase This starts separating memory management of string data and string utilities like `String::formatted`. This would also allow to reuse the same storage in `DeprecatedString` in the future.	2024-01-21 16:16:15 -07:00
Dan Klishch	6e2f627cb3	AK: Move StringData from String.cpp to a newly created StringInternals.h This is done to allow using it in files other than AK/String.cpp.	2024-01-21 16:16:15 -07:00
Dan Klishch	855ea192be	AK: Add AK_MAKE_DEFAULT_COPYABLE	2024-01-21 16:16:15 -07:00
Dan Klishch	7f8d69ee2f	AK: Remove explicit String::operator!= in favor of defaulted one	2024-01-21 16:16:15 -07:00
Dan Klishch	b5f1a48a7c	AK+Everywhere: Remove JsonValue APIs with implicit default values	2024-01-21 15:47:53 -07:00
Dan Klishch	c49819cced	AK+GMLCompiler+LibWeb: Remove JsonValue::is_double This concludes a series of patches which remove the ability to observe which arithmetic type is used to store number in JsonValue.	2024-01-21 15:47:53 -07:00
Dan Klishch	faef802229	AK+GMLCompiler: Remove JsonValue::as_double() Replace its single (non-test) usage with newly created as_number(), which does not leak information about internal integer storage type.	2024-01-21 15:47:53 -07:00
Dan Klishch	5230d2af91	AK+WebContent: Remove JsonValue::as_{i,u}{32,64}()	2024-01-21 15:47:53 -07:00
Ali Mohammad Pur	4f6c9f410c	AK+LibCore: Add BufferedSocket::can_read_up_to_delimiter() This method (unlike can_read_line) ensures that the delimiter is present in the buffer, and doesn't return true after eof when the delimiter is absent.	2024-01-21 21:13:58 +01:00
Ali Mohammad Pur	4d1d88aa16	AK: Make the :hex-dump format specifier print all characters Previously the final line would be skipped if it was not a multiple of \|width\|, this makes the character view show up for that line.	2024-01-21 21:13:58 +01:00
Tim Ledbetter	65827826fe	AK: Add `CharacterTypes::is_ascii_base36_digit()` This can be used to validate the string passed to `parse_ascii_base36_digit()`.	2024-01-13 19:01:35 -07:00
Dan Klishch	ccd701809f	Everywhere: Add deprecated_ prefix to `JsonValue::to_byte_string` `JsonValue::to_byte_string` has peculiar type-erasure semantics which is not usually intended. Unfortunately, it also has a very stereotypical name which does not warn about unexpected behavior. So let's prefix it with `deprecated_` to make new code use `as_string` if it just wants to get string value or `serialized<StringBuilder>` if it needs to do proper serialization.	2024-01-12 17:41:34 -07:00
kleines Filmröllchen	eada4f2ee8	AK: Remove ByteString from GenericLexer A bunch of users used consume_specific with a constant ByteString literal, which can be replaced by an allocation-free StringView literal. The generic consume_while overload gains a requires clause so that consume_specific("abc") causes a more understandable and actionable error.	2024-01-12 17:03:53 -07:00
Martin Janiczek	5a8781393a	AK: Cover TestComplex with more tests Related: - video detailing the process of writing these tests: https://www.youtube.com/watch?v=enxglLlALvI - PR fixing bugs the above effort found: https://github.com/SerenityOS/serenity/pull/22025	2024-01-12 16:42:51 -07:00
Martin Janiczek	d52ffcd830	LibTest: Add more numeric generators Rename unsigned_int generator to number_u32. Add generators: - number_u64 - number_f64 - percentage	2024-01-12 16:42:51 -07:00
Andrew Kaster	09ce32039f	AK: Use cast to const void pointer in to_readonly_span helper This lets developers actually hex-dump print `Span<T const>` using the helper as intended.	2024-01-06 10:13:14 +01:00
Timothy Flynn	cae184d7cf	AK: Improve performance of StringUtils::find_last The current algorithm is currently O(N^2) because we forward-search an ever-increasing substring of the haystack. This implementation reduces the search time of a 500,000-length string (where the desired needle is at index 0) from 72 seconds to 2-3 milliseconds.	2024-01-04 11:28:03 -05:00
Timothy Flynn	9cab4958e6	AK: Convert a couple String-related declarations to east-const Caught by clang-format-17. Note that clang-format-16 is fine with this as well (it leaves the const placement alone), it just doesn't perform the formatting to east-const itself.	2024-01-04 11:28:03 -05:00
Timothy Flynn	1b4a23095c	AK: Add a Utf16View::starts_with method Based heavily on Utf8View::starts_with.	2024-01-04 12:43:10 +01:00
Timothy Flynn	c46ba7e68d	AK: Allow constructing a UTF-16 view from a UTF-16 string literal UTF-16 string literals are a language-level feature. It is convenient to be able to construct a Utf16View from these strings.	2024-01-04 12:43:10 +01:00
Aliaksandr Kalenik	e394971209	AK+LibWeb: Use segmented vector to store commands in RecordingPainter Using a vector to represent a list of painting commands results in many reallocations, especially on pages with a lot of content. This change addresses it by introducing a SegmentedVector, which allows fast appending by representing a list as a sequence of fixed-size vectors. Currently, this new data structure supports only the operations used in RecordingPainter, which are appending and iterating.	2023-12-30 23:02:46 +01:00
Andreas Kling	7ad7ae7000	AK: Check URL parser input for invalid (tabs or spaces) in 1 pass Combine 2 passes into 1 by iterating over the input once and checking for both '\t' and '\n'.	2023-12-30 13:49:50 +01:00
Andreas Kling	a19d8a4a37	AK: Add ASCII fast path to Utf8CodePointIterator Much of the UTF-8 data that we'll iterate over will be ASCII only, and we can get a significant speed-up by simply having a fast path when the iterator points at a byte that is obviously an ASCII character (<= 0x7F).	2023-12-30 13:49:50 +01:00
Andreas Kling	75cecd19a5	AK: Skip UTF-8 validation inside URL parser Since we're already building up a percent-encoded ASCII-only string in the internal parser buffer, there's no need to do a second UTF-8 validation pass before assigning each part of the parsed URL. This makes URL parsing signficantly faster.	2023-12-30 13:49:50 +01:00
Andreas Kling	f045a877b4	AK: Implement StringBuilder::append_code_point() more efficiently Instead of do a wrappy MUST(try_append_code_point()), we now inline the UTF-8 encoding logic. This allows us to grow the buffer by the right increment up front, and also removes a bunch of ErrorOr ceremony that we don't care about.	2023-12-30 13:49:50 +01:00

1 2 3 4 5 ...

3635 commits