0ct0pu5/ladybird

Author	SHA1	Message	Date
gustrb	5141c86587	AK: Rename CaseInsensitiveStringViewTraits to reflect intent Now it is called `CaseInsensitiveASCIIStringViewTraits`, so we can be more specific about what data structure does it operate onto. ;)	2023-03-14 21:34:32 +00:00
Tim Schumacher	8032724574	CodeGenerators: Ensure that we always print the entire generated output	2023-03-13 15:16:20 +00:00
Tim Schumacher	d5871f5717	AK: Rename Stream::{read,write} to Stream::{read_some,write_some} Similar to POSIX read, the basic read and write functions of AK::Stream do not have a lower limit of how much data they read or write (apart from "none at all"). Rename the functions to "read some [data]" and "write some [data]" (with "data" being omitted, since everything here is reading and writing data) to make them sufficiently distinct from the functions that ensure to use the entire buffer (which should be the go-to function for most usages). No functional changes, just a lot of new FIXMEs.	2023-03-13 15:16:20 +00:00
Sam Atkins	774f328783	LibCore+Everywhere: Return an Error from DirIterator::error() This also removes DirIterator::error_string(), since the same strerror() string will be included when you print the Error itself. Except in `ls` which is still using fprintf() for now.	2023-03-05 20:23:42 +01:00
Timothy Flynn	ca2b030336	LibUnicode: Use binary search for lookups into the generated emoji data This sorts the array of generated emoji data by code point (first by code point length, then by code point value). This lets us use a binary search to find emoji data, rather than the current linear search. In a profile of scrolling around /home/anon/Documents/emoji.txt, this reduces the runtime of Gfx::Emoji::emoji_for_code_points from 69.03% to 28.42%. Within that, Unicode::find_emoji_for_code_points reduces from 28.42% to just 1.95%.	2023-03-05 16:44:20 +01:00
Timothy Flynn	03f32bdf86	LibUnicode: Validate that all emoji images in /res/emoji actually exist This will raise a compile error if an emoji image was neglected to be added to e.g. emoji-serenity.txt, or if the code points are not correct.	2023-03-03 17:09:58 +00:00
Timothy Flynn	fd1fbad1d2	LibGfx+LibUnicode: Support specifying the path to search for emoji Similar to the FontDatabase, this will be needed for Ladybird to find emoji images. We now generate just the file name of emoji image in LibUnicode, and look for that file in the specified path (defaulting to /res/emoji) at runtime.	2023-03-01 14:54:16 +00:00
MacDue	01fa3bb788	LibUnicode: Propagate try_append() errors when building emoji data	2023-02-24 22:18:25 +01:00
Timothy Flynn	8c38d46c1a	LibUnicode: Generate the path to emoji images alongside emoji data This will provide for quicker emoji lookups, rather than having to discover and allocate these paths at runtime before we find out if they even exist.	2023-02-24 19:48:47 +01:00
Tim Schumacher	874c7bba28	LibCore: Remove `Stream.h`	2023-02-13 00:50:07 +00:00
Tim Schumacher	606a3982f3	LibCore: Move Stream-based file into the `Core` namespace	2023-02-13 00:50:07 +00:00
Tim Schumacher	d43a7eae54	LibCore: Rename `File` to `DeprecatedFile` As usual, this removes many unused includes and moves used includes further down the chain.	2023-02-13 00:50:07 +00:00
MacDue	63b11030f0	Everywhere: Use ReadonlySpan<T> instead of Span<T const>	2023-02-08 19:15:45 +00:00
Tim Schumacher	8464da1439	AK: Move `Stream` and `SeekableStream` from `LibCore` `Stream` will be qualified as `AK::Stream` until we remove the `Core::Stream` namespace. `IODevice` now reuses the `SeekMode` that is defined by `SeekableStream`, since defining its own would require us to qualify it with `AK::SeekMode` everywhere.	2023-01-29 19:16:44 -07:00
Linus Groh	6e7459322d	AK: Remove StringBuilder::build() in favor of to_deprecated_string() Having an alias function that only wraps another one is silly, and keeping the more obvious name should flush out more uses of deprecated strings. No behavior change.	2023-01-27 20:38:49 +00:00
Timothy Flynn	8f2589b3b0	LibUnicode: Parse and generate case folding code point data Case folding rules have a similar mapping style as special casing rules, where one code point may map to zero or more case folding rules. These will be used for case-insensitive string comparisons. To see how case folding can differ from other casing rules, consider "ß" (U+00DF): >>> "ß".lower() 'ß' >>> "ß".upper() 'SS' >>> "ß".title() 'Ss' >>> "ß".casefold() 'ss'	2023-01-18 14:43:40 +00:00
Timothy Flynn	9226cf7272	LibUnicode: Rename a special casing variable name in the UCD generator This name will soon be a bit ambiguous with a similar case folding variable name.	2023-01-18 14:43:40 +00:00
Timothy Flynn	8d9fb898d7	LibUnicode: Update out-of-date spec links And remove links that aren't adding much value but will often get out of date (i.e. links to UCD files, which are already all listed in unicode_data.cmake).	2023-01-18 14:43:40 +00:00
Timothy Flynn	b562348d31	LibUnicode: Generate simple case folding mappings for titlecase Note we already generate the special case foldings for titlecase.	2023-01-16 18:33:44 -05:00
Timothy Flynn	12f6793223	LibUnicode: Move Unicode-aware case transformations to a helper file These will be needed by AK::String as well, so move them to a helper file where they can be re-used.	2023-01-09 19:23:46 -07:00
Ben Wiederhake	6fd478b6ce	Everywhere: Remove unused includes of AK/Format.h These instances were detected by searching for files that include AK/Format.h, but don't match the regex: \\b(CheckedFormatString\|critical_dmesgln\|dbgln\|dbgln_if\|dmesgln\|FormatBu ilder\|__FormatIfSupported\|FormatIfSupported\|FormatParser\|FormatString\|Fo rmattable\|Formatter\|__format_value\|HasFormatter\|max_format_arguments\|out \|outln\|set_debug_enabled\|StandardFormatter\|TypeErasedFormatParams\|TypeEr asedParameter\|VariadicFormatParams\|v_critical_dmesgln\|vdbgln\|vdmesgln\|vf ormat\|vout\|warn\|warnln\|warnln_if)\\b (Without the linebreaks.) This regex is pessimistic, so there might be more files that don't actually use any formatting functions. Observe that this revealed that Userland/Libraries/LibC/signal.cpp is missing an include. In theory, one might use LibCPP to detect things like this automatically, but let's do this one step after another.	2023-01-02 20:27:20 -05:00
Tim Schumacher	ed4c2f2f8e	LibCore: Rename `Stream::read_all` to `read_until_eof` This generally seems like a better name, especially if we somehow also need a better name for "read the entire buffer, but not the entire file" somewhere down the line.	2022-12-12 14:16:42 +01:00
Thomas Queiroz	6debd967ba	Lagom/CodeGenerators: Use HashMap::try_ensure_capacity	2022-12-10 14:29:46 +01:00
Tim Schumacher	2fc2025f49	LibCore: Move `Core::Stream::File::exists()` to `Core::File` `Core::Stream::File` shouldn't hold any utility methods that are unrelated to constructing a `Core::Stream`, so let's just replace the existing `Core::File::exists` with the nicer looking implementation.	2022-12-08 12:52:14 +00:00
Linus Groh	57dc179b1f	Everywhere: Rename to_{string => deprecated_string}() where applicable This will make it easier to support both string types at the same time while we convert code, and tracking down remaining uses. One big exception is Value::to_string() in LibJS, where the name is dictated by the ToString AO.	2022-12-06 08:54:33 +01:00
Linus Groh	6e19ab2bbc	AK+Everywhere: Rename String to DeprecatedString We have a new, improved string type coming up in AK (OOM aware, no null state), and while it's going to use UTF-8, the name UTF8String is a mouthful - so let's free up the String name by renaming the existing class. Making the old one have an annoying name will hopefully also help with quick adoption :^)	2022-12-06 08:54:33 +01:00
Linus Groh	babfc13c84	Everywhere: Remove 'clang-format off' comments that are no longer needed https://github.com/SerenityOS/serenity/pull/15654#issuecomment-1322554496	2022-12-03 23:52:23 +00:00
Linus Groh	d26aabff04	Everywhere: Run clang-format	2022-12-03 23:52:23 +00:00
Timothy Flynn	b2164ad979	Meta: Do not hard-code index types for UCD/CLDR/TZDB code generators Hand-picking the smallest index type that fits a particular generated array started with commit `3ad159537e`. This was to reduce the size of the generated library. Since then, the number of types using UniqueStorage has grown a ton, creating a long list of types for which index types are manually picked. When a new UCD/CLDR/TZDB is released, and the current index type no longer fits the generated data, we fail to generate. Tracking down which index caused the failure is a pretty annoying process. Instead, we can just use size_t while in the generators themselves, then automatically pick the size needed for the generated code.	2022-11-18 17:00:51 +00:00
Gunnar Beutner	4e406b0730	Meta+LibUnicode: Avoid relocations for emoji data Previously each emoji had its own symbol in the library which was then referred to by another symbol. This caused thousands of avoidable data relocations at load time. This saves about 122kB RAM for each process which uses LibUnicode.	2022-11-06 17:34:06 +01:00
Gunnar Beutner	2d3567ee92	Meta+LibUnicode: Avoid relocations for static unicode data Previously the s_decomposition_mappings variable would refer to other data in s_decomposition_mappings_data. This would cause thousands of avoidable relocations at load time. This saves about 128kB RAM for each process which uses LibUnicode.	2022-11-06 17:34:06 +01:00
Timothy Flynn	b820b9b2ff	LibUnicode: Make the generated .h and .cpp paths for emoji data optional This is to allow people making emoji to run the generator to create the expected commit message format.	2022-11-03 16:37:04 +00:00
Timothy Flynn	bd592480e4	Meta: Replace Bash script for generating emoji.txt with C++ generator We currently have two build-time parsers for the UCD's emoji-test.txt file. To prepare for future changes, this removes the Bash parser and moves its functionality to the newer C++ parser.	2022-10-27 12:59:56 +02:00
demostanis	3e8b5ac920	AK+Everywhere: Turn bool keep_empty to an enum in split* functions	2022-10-24 23:29:18 +01:00
Timothy Flynn	f08a979b96	LibUnicode: Remove GCC codegen workaround Reverts commits: `ffbf5596cd` `f190e394b3`	2022-10-07 18:21:40 +01:00
Timothy Flynn	f38c68177b	LibUnicode: Update code point ideographic replacements for Unicode 15	2022-10-07 18:17:40 +01:00
Andreas Kling	f190e394b3	LibUnicode: Let's use the GCC 11/12 workaround on all platforms I seem to be getting some miscompiles on Linux as well, so let's make the hitherto macOS-specific workaround universal.	2022-10-06 17:15:28 +02:00
matcool	70d0c1616f	LibUnicode: Add decomposition mappings and Unicode normalization The mappings are exposed via `Unicode::code_point_decomposition(u32)` and `Unicode::code_point_decompositions()`, the latter being useful for reverse searching a code point from its decomposition. The normalization code does not make use of `Quick_Check` props (https://www.unicode.org/reports/tr44/#Decompositions_and_Normalization), meaning no quick check optimizations.	2022-10-06 08:24:39 -04:00
Nico Weber	2af028132a	AK+Everywhere: Add AK_COMPILER_{GCC,CLANG} and use them most places Doesn't use them in libc headers so that those don't have to pull in AK/Platform.h. AK_COMPILER_GCC is set _only_ for gcc, not for clang too. (__GNUC__ is defined in clang builds as well.) Using AK_COMPILER_GCC simplifies things some. AK_COMPILER_CLANG isn't as much of a win, other than that it's consistent with AK_COMPILER_GCC.	2022-10-04 23:35:07 +01:00
Nico Weber	ffbf5596cd	Lagom: Work around gcc codegen bug Without this, GenerateUnicodeData crashes when run during the build. With this, `serenity.sh run` brings up a running SerenityOS. Since GenerateUnicodeData doesn't take a lot of time to run, just disable optimizations to work around the problem for now. Works around #15449.	2022-10-03 15:30:51 +01:00
Timothy Flynn	739798e075	LibUnicode: Use recently added Core::Stream::read_all in code generators The generators had a manual implementation when Core::Stream did not have a read_all method.	2022-09-21 14:04:22 +01:00
Timothy Flynn	b7ef36aa36	LibUnicode: Parse and generate custom emoji added for SerenityOS Parse emoji from emoji-serenity.txt to allow displaying their names and grouping them together in the EmojiInputDialog. This also adds an "Unknown" value to the EmojiGroup enum. This will be useful for emoji that aren't found in the UCD, or for when UCD downloads are disabled.	2022-09-11 20:33:57 +01:00
Timothy Flynn	0aadd4869d	LibUnicode: Generate emoji data for non-fully-qualified emoji This allows us to find emoji data for files such as /res/emoji/U+A9.png. U+00A9 is not fully-qualified (its full form is U+00A9 U+FE0F). But the UCD has unqualified data for this code point; generating it allows us to categorize these emoji appropriately in the EmojiInputDialog.	2022-09-11 20:33:57 +01:00
Timothy Flynn	b61eca0a1e	LibUncode: Parse and generate emoji code point data According to TR #51, the "best definition of the full set [of emojis] is in the emoji-test.txt file". This defines not only the emoji themselves, but the order in which they should be displayed, and what "group" of emojis they belong to.	2022-09-08 23:12:31 +01:00
Timothy Flynn	f082b6ae48	LibUnicode: Generate a separate Locale enumeration for special casing The UCD only cares about a few locales for special casing rules (az, lt, and tr). Unfortunately, LibUnicode cannot use LibLocale once the libraries are separate because LibLocale will need to use LibUnicode for many more things; thus there would be a circular dependency. Instead, just generate the small enum needed for this one use case.	2022-09-05 14:37:16 -04:00
Timothy Flynn	43a3471298	LibLocale: Move locale source files to the LibLocale folder These are still included in LibUnicode, but this updates their location and the include paths of other files which include them.	2022-09-05 14:37:16 -04:00
Timothy Flynn	ff48220dca	Userland: Move files destined for LibLocale to the Locale namespace	2022-09-05 14:37:16 -04:00
Timothy Flynn	1e0276f541	LibLocale+LibUnicode: Move generated CLDR data files to LibLocale folder They are still included into LibUnicode, but this moves their generated location to be under LibLocale.	2022-09-05 14:37:16 -04:00
Timothy Flynn	89d1813b5d	LibUnicode: Move CLDR data generators to a LibLocale subfolder To prepare for placing all CLDR generated data in a new library, LibLocale, this moves the code generators for the CLDR data to the LibLocale subfolder.	2022-09-05 14:37:16 -04:00
davidot	cd763de280	LibJS+LibUnicode: Move some constant arrays to a separate header Since LibUnicode depends on this data it used to include Intl/AbstractOperations which in turn includes a number of other LibJS headers. By moving this to its own header with minimal includes we can save on rebuilding LibUnicode for unrelated LibJS header changes.	2022-08-27 10:55:44 -04:00

1 2 3 4 5 ...

262 commits