beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-21 15:10:19 +00:00

Author	SHA1	Message	Date
stasoid	9ebed7d8d5	AK: Add StringBuilder::append_repeated(StringView, size_t) By analogy with append_repeated(char, size_t)	2024-11-09 12:42:27 -07:00
Andreas Kling	cc4b3cbacc	Meta: Update my e-mail address everywhere Some checks are pending CI / Lagom (false, FUZZ, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details CI / Lagom (false, NO_FUZZ, macos-14, macOS, Clang) (push) Waiting to run Details CI / Lagom (false, NO_FUZZ, ubuntu-24.04, Linux, GNU) (push) Waiting to run Details CI / Lagom (true, NO_FUZZ, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (macos-14, macOS, macOS-universal2) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (ubuntu-24.04, Linux, Linux-x86_64) (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Push notes / build (push) Waiting to run Details	2024-10-04 13:19:50 +02:00
Timothy Flynn	29879a69a4	AK: Construct Strings from StringBuilder without re-allocating the data Currently, invoking StringBuilder::to_string will re-allocate the string data to construct the String. This is wasteful both in terms of memory and speed. The goal here is to simply hand the string buffer over to String, and let String take ownership of that buffer. To do this, StringBuilder must have the same memory layout as Detail::StringData. This layout is just the members of the StringData class followed by the string itself. So when a StringBuilder is created, we reserve sizeof(StringData) bytes at the front of the buffer. StringData can then construct itself into the buffer with placement new. Things to note: * StringData must now be aware of the actual capacity of its buffer, as that can be larger than the string size. * We must take care not to pass ownership of inlined string buffers, as these live on the stack.	2024-07-20 06:45:49 +02:00
Timothy Flynn	af220af8bf	AK: Remove StringBuilder's UseInlineCapacityOnly feature This feature is unused in Ladybird and will complicate an upcoming patch to hand-off StringBuilder's memory to String.	2024-07-20 06:45:49 +02:00
Tim Ledbetter	5ca2f4dfd7	Everywhere: Remove all KERNEL `#define`s	2024-06-18 09:36:25 +02:00
Andreas Kling	8d7a1e5654	LibWeb: Skip some redundant UTF-8 validation in CSS tokenizer If we're just adding code points to a StringBuilder, there's no need to revalidate the result.	2024-03-24 13:28:24 +01:00
Andreas Kling	3c039903fb	LibTextCodec+AK: Don't validate UTF-8 strings twice UTF8Decoder was already converting invalid data into replacement characters while converting, so we know for sure we have valid UTF-8 by the time conversion is finished. This patch adds a new StringBuilder::to_string_without_validation() and uses it to make UTF8Decoder avoid half the work it was doing.	2023-12-30 13:49:50 +01:00
Ali Mohammad Pur	5e1499d104	Everywhere: Rename {Deprecated => Byte}String This commit un-deprecates DeprecatedString, and repurposes it as a byte string. As the null state has already been removed, there are no other particularly hairy blockers in repurposing this type as a byte string (what it _really_ is). This commit is auto-generated: $ xs=$(ack -l \bDeprecatedString\b\\|deprecated_string AK Userland \ Meta Ports Ladybird Tests Kernel) $ perl -pie 's/\bDeprecatedString\b/ByteString/g; s/deprecated_string/byte_string/g' $xs $ clang-format --style=file -i \ $(git diff --name-only \| grep \.cpp\\|\.h) $ gn format $(git ls-files '.gn' '.gni')	2023-12-17 18:25:10 +03:30
Liav A	b2fd51f561	AK: Implement string formatting for FixedStringBuffers To ensure this happens without duplicating code, we allow forcing a StringBuilder object to only use the inline buffer, so the code in the AK/Format.cpp file doesn't need to deal with different underlying storage types (expandable or inline-fixed) at all.	2023-08-12 11:48:48 -06:00
Linus Groh	e76394d96c	AK: Remove infallible version of StringBuilder::to_byte_buffer Also drop the try_ prefix from the fallible function, as it is no longer needed to distinguish the two.	2023-03-09 15:51:00 +00:00
Karol Baraniecki	b4b283670d	AK: Introduce a fallible version of StringBuilder::to_byte_buffer Name it StringBuilder::try_to_byte_buffer accordingly :^)	2023-03-09 12:59:57 +00:00
Sam Atkins	1453ac79e7	AK: Add StringBuilder::to_fly_string()	2023-02-15 12:48:26 -05:00
Timothy Flynn	79aaa2fe0f	AK: Allow the kernel to have access to StringBuilder::to_string This is mostly to prevent String.h from acquiring ifdef-soup. In any case, it's fine for the kernel to see this symbol as it is fallible.	2023-01-28 00:13:46 +00:00
Linus Groh	6e7459322d	AK: Remove StringBuilder::build() in favor of to_deprecated_string() Having an alias function that only wraps another one is silly, and keeping the more obvious name should flush out more uses of deprecated strings. No behavior change.	2023-01-27 20:38:49 +00:00
MacDue	2366265c53	AK: Add StringBuilder::try_join() This is a failable version of StringBuilder::join().	2023-01-14 12:37:00 +01:00
MacDue	9a120d7243	AK: Add support for "debug only" formatters These are formatters that can only be used with debug print functions, such as dbgln(). Currently this is limited to Formatter<ErrorOr<T>>. With this you can still debug log ErrorOr values (good for debugging), but trying to use them in any String::formatted() call will fail (which prevents .to_string() errors with the new failable strings being ignored). You make a formatter debug only by adding a constexpr method like: static constexpr bool is_debug_only() { return true; }	2023-01-13 21:09:26 +00:00
Ali Mohammad Pur	543890c5c9	AK: Add a fallible StringBuilder::create() factory function This is nice, and is also used by the Jakt runtime.	2022-12-11 20:44:54 +03:30
Moustafa Raafat	b8f1e1bed2	Everywhere: Remove unnecessary AK and Detail namespace scoping	2022-12-09 11:25:30 +00:00
Andreas Kling	a3e82eaad3	AK: Introduce the new String, replacement for DeprecatedString DeprecatedString (formerly String) has been with us since the start, and it has served us well. However, it has a number of shortcomings that I'd like to address. Some of these issues are hard if not impossible to solve incrementally inside of DeprecatedString, so instead of doing that, let's build a new String class and then incrementally move over to it instead. Problems in DeprecatedString: - It assumes string allocation never fails. This makes it impossible to use in allocation-sensitive contexts, and is the reason we had to ban DeprecatedString from the kernel entirely. - The awkward null state. DeprecatedString can be null. It's different from the empty state, although null strings are considered empty. All code is immediately nicer when using Optional<DeprecatedString> but DeprecatedString came before Optional, which is how we ended up like this. - The encoding of the underlying data is ambiguous. For the most part, we use it as if it's always UTF-8, but there have been cases where we pass around strings in other encodings (e.g ISO8859-1) - operator[] and length() are used to iterate over DeprecatedString one byte at a time. This is done all over the codebase, and will not give the right results unless the string is all ASCII. How we solve these issues in the new String: - Functions that may allocate now return ErrorOr<String> so that ENOMEM errors can be passed to the caller. - String has no null state. Use Optional<String> when needed. - String is always UTF-8. This is validated when constructing a String. We may need to add a bypass for this in the future, for cases where you have a known-good string, but for now: validate all the things! - There is no operator[] or length(). You can get the underlying data with bytes(), but for iterating over code points, you should be using an UTF-8 iterator. Furthermore, it has two nifty new features: - String implements a small string optimization (SSO) for strings that can fit entirely within a pointer. This means up to 3 bytes on 32-bit platforms, and 7 bytes on 64-bit platforms. Such small strings will not be heap-allocated. - String can create substrings without making a deep copy of the substring. Instead, the superstring gets +1 refcount from the substring, and it acts like a view into the superstring. To make substrings like this, use the substring_with_shared_superstring() API. One caveat: - String does not guarantee that the underlying data is null-terminated like DeprecatedString does today. While this was nifty in a handful of places where we were calling C functions, it did stand in the way of shared-superstring substrings.	2022-12-06 15:21:26 +01:00
Linus Groh	57dc179b1f	Everywhere: Rename to_{string => deprecated_string}() where applicable This will make it easier to support both string types at the same time while we convert code, and tracking down remaining uses. One big exception is Value::to_string() in LibJS, where the name is dictated by the ToString AO.	2022-12-06 08:54:33 +01:00
Linus Groh	6e19ab2bbc	AK+Everywhere: Rename String to DeprecatedString We have a new, improved string type coming up in AK (OOM aware, no null state), and while it's going to use UTF-8, the name UTF8String is a mouthful - so let's free up the String name by renaming the existing class. Making the old one have an annoying name will hopefully also help with quick adoption :^)	2022-12-06 08:54:33 +01:00
Andreas Kling	ae3ffdd521	AK: Make it possible to not `using` AK classes into the global namespace This patch adds the `USING_AK_GLOBALLY` macro which is enabled by default, but can be overridden by build flags. This is a step towards integrating Jakt and AK types.	2022-11-26 15:51:34 +01:00
Lucas CHOLLET	62b8ccaffc	StringBuilder: Add try_append_repeated() and append_repeated() This two methods add the character as many times as specified by the second parameter.	2022-09-15 14:08:21 +01:00
Idan Horowitz	9da8c78133	AK: Add a try variant of StringBuilder::append_escaped_for_json This will allow us to make a fallible version of the JSON serializers.	2022-02-27 20:37:57 +01:00
Linus Groh	b253bca807	AK: Add optional format string parameter to String{,Builder}::join() Allow specifying a custom format string that's being used for each item instead of hardcoding "{}".	2022-02-23 21:53:30 +00:00
Idan Horowitz	8f093e91e0	AK: Exclude StringBuilder String APIs from the Kernel These APIs are only used by userland, and String is OOM-infallible, so let's just ifdef it out of the Kernel.	2022-02-16 22:21:37 +01:00
Andreas Kling	8a51f64503	AK: Increase StringBuilder's inline buffer size from 128 to 256 bytes	2021-12-26 01:42:58 +01:00
Andreas Kling	216e21a1fa	AK: Convert AK::Format formatting helpers to returning ErrorOr<void> This isn't a complete conversion to ErrorOr<void>, but a good chunk. The end goal here is to propagate buffer allocation failures to the caller, and allow the use of TRY() with formatting functions.	2021-11-17 00:21:13 +01:00
Andreas Kling	008355c222	AK: Add failable try_* functions to StringBuilder These will allow us to start using TRY() with StringBuilder operations.	2021-11-17 00:21:13 +01:00
Andreas Kling	8b1108e485	Everywhere: Pass AK::StringView by value	2021-11-11 01:27:46 +01:00
Andreas Kling	a15ed8743d	AK: Make ByteBuffer::try_* functions return ErrorOr<void> Same as Vector, ByteBuffer now also signals allocation failure by returning an ENOMEM Error instead of a bool, allowing us to use the TRY() and MUST() patterns.	2021-11-10 21:58:58 +01:00
Ali Mohammad Pur	3a9f00c59b	Everywhere: Use OOM-safe ByteBuffer APIs where possible If we can easily communicate failure, let's avoid asserting and report failure instead.	2021-09-06 01:53:26 +02:00
Brian Gianforcaro	f0b3aa0331	Everywhere: Pass AK::Format TypeErasedFormatParams by reference This silences a overeager warning in sonar cloud, warning that slicing could occur with `VariadicFormatParams` which derives from `TypeErasedFormatParams`. Reference: https://sonarcloud.io/project/issues?id=SerenityOS_serenity&issues=AXuVPBO_k92xXUF3qWsm&open=AXuVPBO_k92xXUF3qWsm	2021-08-30 15:50:00 +04:30
Timothy Flynn	c16aca7abf	AK+Kernel: Add StringBuilder::append overload for UTF-16 views Currently, to append a UTF-16 view to a StringBuilder, callers must first convert the view to UTF-8 and then append the copy. Add a UTF-16 overload so callers do not need to hold an entire copy in memory.	2021-08-10 23:07:50 +02:00
Timothy Flynn	5978caf96b	AK: Convert StringBuilder to use east-const	2021-08-10 23:07:50 +02:00
Ali Mohammad Pur	3829bf115c	AK: Make StringBuilder::join() use appendff() instead of append() `append()` is almost never going to select the overload that is desired. e.g. it will append chars when you pass it a Vector<size_t>, which is definitely not the right overload :)	2021-08-06 01:14:03 +02:00
Gunnar Beutner	de0aa44bb6	AK: Remove the m_length member for StringBuilder Instead we can just use ByteBuffer::size() which already keeps track of the buffer's size.	2021-05-31 14:49:00 +04:30
Max Wipfli	f51b0729f5	AK: Implement StringBuilder::append_as_lowercase(char ch) This patch adds a convenience method to AK::StringBuilder which converts ASCII uppercase characters to lowercase before appending them.	2021-05-18 21:02:07 +02:00
Gunnar Beutner	fcaf98361f	AK: Turn ByteBuffer into a value type Previously ByteBuffer would internally hold a RefPtr to the byte buffer and would behave like a reference type, i.e. copying a ByteBuffer would not create a duplicate byte buffer, but rather two objects which refer to the same internal buffer. This also changes ByteBuffer so that it has some internal capacity much like the Vector<T> type. Unlike Vector<T> however a byte buffer's data may be uninitialized. With this commit ByteBuffer makes use of the kmalloc_good_size() API to pick an optimal allocation size for its internal buffer.	2021-05-16 17:49:42 +02:00
Andreas Kling	834b6508d7	AK: Remove StringBuilder::appendf() All users have been converted to using AK::Format via appendff().	2021-05-07 21:12:09 +02:00
Brian Gianforcaro	1682f0b760	Everything: Move to SPDX license identifiers in all files. SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt *	2021-04-22 11:22:27 +02:00
Brian Gianforcaro	7db74a6b3e	AK: Annotate StringBuilder functions as [[nodiscard]]	2021-04-11 12:50:33 +02:00
AnotherTest	7c2754c3a6	AK+Kernel+Userland: Enable some more compiletime format string checks This enables format string checks for three more functions: - String::formatted() - Builder::appendff() - KBufferBuilder::appendff()	2021-02-23 13:59:33 +01:00
Lenny Maiorani	e6f907a155	AK: Simplify constructors and conversions from nullptr_t Problem: - Many constructors are defined as `{}` rather than using the ` = default` compiler-provided constructor. - Some types provide an implicit conversion operator from `nullptr_t` instead of requiring the caller to default construct. This violates the C++ Core Guidelines suggestion to declare single-argument constructors explicit (https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#c46-by-default-declare-single-argument-constructors-explicit). Solution: - Change default constructors to use the compiler-provided default constructor. - Remove implicit conversion operators from `nullptr_t` and change usage to enforce type consistency without conversion.	2021-01-12 09:11:45 +01:00
Sahan Fernando	ccf4368ca5	AK: Enable format string warnings for AK printf wrappers	2021-01-11 21:06:32 +01:00
Andreas Kling	54ade31d84	AK: Add some inline capacity to StringBuilder This patch adds a 128-byte inline buffer that we use before switching to using a dynamically growing ByteBuffer. This allows us to avoid heap allocations in many cases, and totally incidentally also speeds up @nico's favorite test, "disasm /bin/id" more than 2x. :^)	2020-11-24 22:06:51 +01:00
Andreas Kling	5e164052f6	AK+Kernel: Escape JSON keys & values Grab the escaping logic from JSON string value serialization and use it for serializing all keys and values. Fixes #3917.	2020-11-02 12:56:36 +01:00
asynts	6351a56d27	AK+Format: Do some housekeeping in the format implementation.	2020-10-02 20:48:19 +02:00
asynts	b7a4c4482f	AK: Resolve format related circular dependencies properly. With this commit, <AK/Format.h> has a more supportive role and isn't used directly. Essentially, there now is a public 'vformat' function ('v' for vector) which takes already type erased parameters. The name is choosen to indicate that this function behaves similar to C-style functions taking a va_list equivalent. The interface for frontend users are now 'String::formatted' and 'StringBuilder::appendff'.	2020-09-23 21:45:28 +02:00
asynts	e5497a326a	AK: Add StringBuilder::appendff using the new format. StringBuilder::appendf was already used, thus this name. If we some day replace all usages of printf, we could rename this method.	2020-09-22 15:06:40 +02:00

1 2

77 commits