0ct0pu5/ladybird

Author	SHA1	Message	Date
Idan Horowitz	be475cd6a8	Kernel: Handle OOM when adding memory regions to Spaces :^)	2021-07-15 00:49:41 +02:00
Andreas Kling	859e5741ff	Kernel: Fix Process use-after-free in Thread finalization We leak a ref() onto every user process when constructing them, either via Process::create_user_process(), or via Process::sys$fork(). This ref() is balanced by a corresponding unref() in Thread::WaitBlockCondition::finalize(). Since kernel processes don't have a leaked ref() on them, this led to an extra Process::unref() on kernel processes during finalization. This happened during every boot, with the `init_stage2` process. Found by turning off kfree() scrubbing. :^)	2021-07-14 22:36:29 +02:00
Brian Gianforcaro	84b4b9447d	Kernel: Move new process registration out of Space spinlock scope There appears to be no reason why the process registration needs to happen under the space spin lock. As the first thread is not started yet it should be completely uncontested, but it's still bad practice.	2021-07-12 10:20:21 +02:00
Liav A	12b6e69150	Kernel: Introduce the new ProcFS design The new ProcFS design consists of two main parts: 1. The representative ProcFS class, which is derived from the FS class. The ProcFS and its inodes are much more lean - merely 3 classes to represent the common type of inodes - regular files, symbolic links and directories. They're backed by a ProcFSExposedComponent object, which is responsible for the functional operation behind the scenes. 2. The backend of the ProcFS - the ProcFSComponentsRegistrar class and all derived classes from the ProcFSExposedComponent class. These together form the entire backend and handle all the functions you can expect from the ProcFS. The ProcFSExposedComponent derived classes split to 3 types in the manner of lifetime in the kernel: 1. Persistent objects - this category includes all basic objects, like the root folder, /proc/bus folder, main blob files in the root folders, etc. These objects are persistent and cannot die ever. 2. Semi-persistent objects - this category includes all PID folders, and subdirectories to the PID folders. It also includes exposed objects like the unveil JSON'ed blob. These object are persistent as long as the the responsible process they represent is still alive. 3. Dynamic objects - this category includes files in the subdirectories of a PID folder, like /proc/PID/fd/* or /proc/PID/stacks/*. Essentially, these objects are always created dynamically and when no longer in need after being used, they're deallocated. Nevertheless, the new allocated backend objects and inodes try to use the same InodeIndex if possible - this might change only when a thread dies and a new thread is born with a new thread stack, or when a file descriptor is closed and a new one within the same file descriptor number is opened. This is needed to actually be able to do something useful with these objects. The new design assures that many ProcFS instances can be used at once, with one backend for usage for all instances.	2021-06-29 20:53:59 +02:00
Gunnar Beutner	df9e73de25	Kernel: Add x86_64 support for fork()	2021-06-29 20:03:36 +02:00
Gunnar Beutner	2a78bf8596	Kernel: Fix the return type for syscalls The Process::Handler type has KResultOr<FlatPtr> as its return type. Using a different return type with an equally-sized template parameter sort of works but breaks once that condition is no longer true, e.g. for KResultOr<int> on x86_64. Ideally the syscall handlers would also take FlatPtrs as their args so we can get rid of the reinterpret_cast for the function pointer but I didn't quite feel like cleaning that up as well.	2021-06-28 22:29:28 +02:00
Gunnar Beutner	f285241cb8	Kernel: Rename Thread::tss to Thread::regs and add x86_64 support We're using software context switches so calling this struct tss is somewhat misleading.	2021-06-27 15:46:42 +02:00
Gunnar Beutner	38fca26f54	Kernel: Add stubs for missing x86_64 functionality This adds just enough stubs to make the kernel compile on x86_64. Obviously it won't do anything useful - in fact it won't even attempt to boot because Multiboot doesn't support ELF64 binaries - but it gets those compiler errors out of the way so more progress can be made getting all the missing functionality in place.	2021-06-24 09:27:13 +02:00
Brian Gianforcaro	9fccbde371	Kernel: Switch Process to InstrusiveList from InlineLinkedList	2021-06-07 09:42:55 +02:00
Brian Gianforcaro	ede1483e48	Kernel: Make Process creation APIs OOM safe This change looks more involved than it actually is. This simply reshuffles the previous Process constructor and splits out the parts which can fail (resource allocation) into separate methods which can be called from a factory method. The factory is then used everywhere instead of the constructor.	2021-05-15 09:01:32 +02:00
Brian Gianforcaro	8bf4201f50	Kernel: Move process creation perf events to PerformanceManager	2021-05-07 15:35:23 +02:00
Gunnar Beutner	eb798d5538	Kernel+Profiler: Improve profiling subsystem This turns the perfcore format into more a log than it was before, which lets us properly log process, thread and region creation/destruction. This also makes it unnecessary to dump the process' regions every time it is scheduled like we did before. Incidentally this also fixes 'profile -c' because we previously ended up incorrectly dumping the parent's region map into the profile data. Log-based mmap support enables profiling shared libraries which are loaded at runtime, e.g. via dlopen(). This enables profiling both the parent and child process for programs which use execve(). Previously we'd discard the profiling data for the old process. The Profiler tool has been updated to not treat thread IDs as process IDs anymore. This enables support for processes with more than one thread. Also, there's a new widget to filter which process should be displayed.	2021-04-26 17:13:55 +02:00
Andreas Kling	b91c49364d	AK: Rename adopt() to adopt_ref() This makes it more symmetrical with adopt_own() (which is used to create a NonnullOwnPtr from the result of a naked new.)	2021-04-23 16:46:57 +02:00
Brian Gianforcaro	1682f0b760	Everything: Move to SPDX license identifiers in all files. SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt *	2021-04-22 11:22:27 +02:00
Idan Horowitz	2c93123daf	Kernel: Replace process' regions vector with a Red Black tree This should provide some speed up, as currently searches for regions containing a given address were performed in O(n) complexity, while this container allows us to do those in O(logn).	2021-04-12 18:03:44 +02:00
Andreas Kling	49a0f40ff0	Kernel: Inherit the dumpable flag on sys$fork() This regressed at some point recently. All children were non-dumpable until manually opting into it.	2021-03-11 14:35:37 +01:00
Andreas Kling	b7b7a48c66	Kernel: Move process signal trampoline address into protected data	2021-03-11 14:21:49 +01:00
Andreas Kling	08e0e2eb41	Kernel: Move process umask into protected data :^)	2021-03-11 14:21:49 +01:00
Andreas Kling	90c0f9664e	Kernel: Don't keep protected Process data in a separate allocation The previous architecture had a huge flaw: the pointer to the protected data was itself unprotected, allowing you to overwrite it at any time. This patch reorganizes the protected data so it's part of the Process class itself. (Actually, it's a new ProcessBase helper class.) We use the first 4 KB of Process objects themselves as the new storage location for protected data. Then we make Process objects page-aligned using MAKE_ALIGNED_ALLOCATED. This allows us to easily turn on/off write-protection for everything in the ProcessBase portion of Process. :^) Thanks to @bugaevc for pointing out the flaw! This is still not perfect but it's an improvement.	2021-03-11 14:21:49 +01:00
Andreas Kling	de6c5128fd	Kernel: Move process pledge promises into protected data	2021-03-10 22:50:00 +01:00
Andreas Kling	d677a73b0e	Kernel: Move process extra_gids into protected data :^)	2021-03-10 22:30:02 +01:00
Andreas Kling	cbcf891040	Kernel: Move select Process members into protected memory Process member variable like m_euid are very valuable targets for kernel exploits and until now they have been writable at all times. This patch moves m_euid along with a whole bunch of other members into a new Process::ProtectedData struct. This struct is remapped as read-only memory whenever we don't need to write to it. This means that a kernel write primitive is no longer enough to overwrite a process's effective UID, you must first unprotect the protected data where the UID is stored. :^)	2021-03-10 22:30:02 +01:00
Andreas Kling	a819eb5016	Kernel: Skip TLB flushes while cloning regions in sys$fork() Since we know for sure that the virtual memory regions in the new process being created are not being used on any CPU, there's no need to do TLB flushes for every mapped page.	2021-03-03 22:57:45 +01:00
Andreas Kling	ac71775de5	Kernel: Make all syscall functions return KResultOr<T> This makes it a lot easier to return errors since we no longer have to worry about negating EFOO errors and can just return them flat.	2021-03-01 13:54:32 +01:00
Andreas Kling	5a595ef134	Kernel: Use dbgln_if() in sys$fork()	2021-02-17 15:34:32 +01:00
Andreas Kling	68e3616971	Kernel: Forked children should inherit the signal trampoline address Fixes #5347.	2021-02-14 18:38:46 +01:00
Andreas Kling	f1b5def8fd	Kernel: Factor address space management out of the Process class This patch adds Space, a class representing a process's address space. - Each Process has a Space. - The Space owns the PageDirectory and all Regions in the Process. This allows us to reorganize sys$execve() so that it constructs and populates a new Space fully before committing to it. Previously, we would construct the new address space while still running in the old one, and encountering an error meant we had to do tedious and error-prone rollback. Those problems are now gone, replaced by what's hopefully a set of much smaller problems and missing cleanups. :^)	2021-02-08 18:27:28 +01:00
AnotherTest	09a43969ba	Everywhere: Replace dbgln<flag>(...) with dbgln_if(flag, ...) Replacement made by `find Kernel Userland -name '.h' -o -name '.cpp' \| sed -i -Ee 's/dbgln\b<(\w+)>\(/dbgln_if(\1, /g'`	2021-02-08 18:08:55 +01:00
Andreas Kling	823186031d	Kernel: Add a way to specify which memory regions can make syscalls This patch adds sys$msyscall() which is loosely based on an OpenBSD mechanism for preventing syscalls from non-blessed memory regions. It works similarly to pledge and unveil, you can call it as many times as you like, and when you're finished, you call it with a null pointer and it will stop accepting new regions from then on. If a syscall later happens and doesn't originate from one of the previously blessed regions, the kernel will simply crash the process.	2021-02-02 20:13:44 +01:00
asynts	7cf0c7cc0d	Meta: Split debug defines into multiple headers. The following script was used to make these changes: #!/bin/bash set -e tmp=$(mktemp -d) echo "tmp=$tmp" find Kernel $ -name '.cpp' -o -name '.h' $ \| sort > $tmp/Kernel.files find . $ -path ./Toolchain -prune -o -path ./Build -prune -o -path ./Kernel -prune $ -o $ -name '.cpp' -o -name '.h' $ -print \| sort > $tmp/EverythingExceptKernel.files cat $tmp/Kernel.files \| xargs grep -Eho '[A-Z0-9_]+_DEBUG' \| sort \| uniq > $tmp/Kernel.macros cat $tmp/EverythingExceptKernel.files \| xargs grep -Eho '[A-Z0-9_]+_DEBUG' \| sort \| uniq > $tmp/EverythingExceptKernel.macros comm -23 $tmp/Kernel.macros $tmp/EverythingExceptKernel.macros > $tmp/Kernel.unique comm -1 $tmp/Kernel.macros $tmp/EverythingExceptKernel.macros > $tmp/EverythingExceptKernel.unique cat $tmp/Kernel.unique \| awk '{ print "#cmakedefine01 "$1 }' > $tmp/Kernel.header cat $tmp/EverythingExceptKernel.unique \| awk '{ print "#cmakedefine01 "$1 }' > $tmp/EverythingExceptKernel.header for macro in $(cat $tmp/Kernel.unique) do cat $tmp/Kernel.files \| xargs grep -l $macro >> $tmp/Kernel.new-includes \|\|: done cat $tmp/Kernel.new-includes \| sort > $tmp/Kernel.new-includes.sorted for macro in $(cat $tmp/EverythingExceptKernel.unique) do cat $tmp/Kernel.files \| xargs grep -l $macro >> $tmp/Kernel.old-includes \|\|: done cat $tmp/Kernel.old-includes \| sort > $tmp/Kernel.old-includes.sorted comm -23 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.new comm -13 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.old comm -12 $tmp/Kernel.new-includes.sorted $tmp/Kernel.old-includes.sorted > $tmp/Kernel.includes.mixed for file in $(cat $tmp/Kernel.includes.new) do sed -i -E 's/#include <AK\/Debug\.h>/#include <Kernel\/Debug\.h>/' $file done for file in $(cat $tmp/Kernel.includes.mixed) do echo "mixed include in $file, requires manual editing." done	2021-01-26 21:20:00 +01:00
Andreas Kling	c7858622ec	Kernel: Update process promise states on execve() and fork() We now move the execpromises state into the regular promises, and clear the execpromises state. Also make sure to duplicate the promise state on fork. This fixes an issue where "su" would launch a shell which immediately crashed due to not having pledged "stdio".	2021-01-26 15:26:37 +01:00
asynts	8465683dcf	Everywhere: Debug macros instead of constexpr. This was done with the following script: find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec sed -i -E 's/dbgln<debug_([a-z_]+)>/dbgln<\U\1_DEBUG>/' {} \; find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/' -exec sed -i -E 's/if constexpr \(debug_([a-z0-9_]+)/if constexpr \(\U\1_DEBUG/' {} \;	2021-01-25 09:47:36 +01:00
asynts	1a3a0836c0	Everywhere: Use CMake to generate AK/Debug.h. This was done with the help of several scripts, I dump them here to easily find them later: awk '/#ifdef/ { print "#cmakedefine01 "$2 }' AK/Debug.h.in for debug_macro in $(awk '/#ifdef/ { print $2 }' AK/Debug.h.in) do find . $ -name '.cpp' -o -name '.h' -o -name '.in' $ -not -path './Toolchain/' -not -path './Build/*' -exec sed -i -E 's/#ifdef '$debug_macro'/#if '$debug_macro'/' {} \; done # Remember to remove WRAPPER_GERNERATOR_DEBUG from the list. awk '/#cmake/ { print "set("$2" ON)" }' AK/Debug.h.in	2021-01-25 09:47:36 +01:00
asynts	7b0a1a98d9	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.	2021-01-22 22:14:30 +01:00
Andreas Kling	bf0719092f	Kernel+Userland: Remove shared buffers (shbufs) All users of this mechanism have been switched to anonymous files and passing file descriptors with sendfd()/recvfd(). Shbufs got us where we are today, but it's time we say good-bye to them and welcome a much more idiomatic replacement. :^)	2021-01-17 09:07:32 +01:00
asynts	938e5c7719	Everywhere: Replace a bundle of dbg with dbgln. These changes are arbitrarily divided into multiple commits to make it easier to find potentially introduced bugs with git bisect.Everything: The modifications in this commit were automatically made using the following command: find . -name '.cpp' -exec sed -i -E 's/dbg << ("[^"{]");/dbgln$\1$;/' {} \;	2021-01-09 21:11:09 +01:00
Tom	2f429bd2d5	Kernel: Pass new region owner to Region::clone	2021-01-01 23:43:44 +01:00
Tom	476f17b3f1	Kernel: Merge PurgeableVMObject into AnonymousVMObject This implements memory commitments and lazy-allocation of committed memory.	2021-01-01 23:43:44 +01:00
Tom	b2a52f6208	Kernel: Implement lazy committed page allocation By designating a committed page pool we can guarantee to have physical pages available for lazy allocation in mappings. However, when forking we will overcommit. The assumption is that worst-case it's better for the fork to die due to insufficient physical memory on COW access than the parent that created the region. If a fork wants to ensure that all memory is available (trigger a commit) then it can use madvise. This also means that fork now can gracefully fail if we don't have enough physical pages available.	2021-01-01 23:43:44 +01:00
AnotherTest	a9184fcb76	Kernel: Implement unveil() as a prefix-tree Fixes #4530.	2020-12-26 11:54:54 +01:00
Andreas Kling	ed5c26d698	AK: Remove custom %w format string specifier This was a non-standard specifier alias for %04x. This patch replaces all uses of it with new-style formatting functions instead.	2020-12-25 17:05:05 +01:00
Tom	a89648e159	Kernel: Inherit shared buffers when forking We need to create a reference for the new PID for each shared buffer that the process had a reference to. If the process subsequently get replaced through exec, those references will be dropped again. But if exec for some reason fails then other code, such as global destructors could still expect having access to them. Fixes #4076	2020-11-23 09:39:32 +01:00
Tom	75f61fe3d9	AK: Make RefPtr, NonnullRefPtr, WeakPtr thread safe This makes most operations thread safe, especially so that they can safely be used in the Kernel. This includes obtaining a strong reference from a weak reference, which now requires an explicit call to WeakPtr::strong_ref(). Another major change is that Weakable::make_weak_ref() may require the explicit target type. Previously we used reinterpret_cast in WeakPtr, assuming that it can be properly converted. But WeakPtr does not necessarily have the knowledge to be able to do this. Instead, we now ask the class itself to deliver a WeakPtr to the type that we want. Also, WeakLink is no longer specific to a target type. The reason for this is that we want to be able to safely convert e.g. WeakPtr<T> to WeakPtr<U>, and before this we just reinterpret_cast the internal WeakLink<T> to WeakLink<U>, which is a bold assumption that it would actually produce the correct code. Instead, WeakLink now operates on just a raw pointer and we only make those constructors/operators available if we can verify that it can be safely cast. In order to guarantee thread safety, we now use the least significant bit in the pointer for locking purposes. This also means that only properly aligned pointers can be used.	2020-11-10 19:11:52 +01:00
Tom	1e2e3eed62	Kernel: Fix a few deadlocks with Thread::m_lock and g_scheduler_lock g_scheduler_lock cannot safely be acquired after Thread::m_lock because another processor may already hold g_scheduler_lock and wait for the same Thread::m_lock.	2020-10-26 08:57:25 +01:00
Tom	838d9fa251	Kernel: Make Thread refcounted Similar to Process, we need to make Thread refcounted. This will solve problems that will appear once we schedule threads on more than one processor. This allows us to hold onto threads without necessarily holding the scheduler lock for the entire duration.	2020-09-27 19:46:04 +02:00
Tom	0fab0ee96a	Kernel: Rename Process::is_ring0/3 to Process::is_kernel/user_process Since "rings" typically refer to code execution and user processes can also execute in ring 0, rename these functions to more accurately describe what they mean: kernel processes and user processes.	2020-09-10 19:57:15 +02:00
Tom	c3d231616c	Kernel: Fix crash when delivering signal to barely created thread We need to wait until a thread is fully set up and ready for running before attempting to deliver a signal. Otherwise we may not have a user stack yet. Also, remove the Skip0SchedulerPasses and Skip1SchedulerPass thread states that we don't really need anymore with software context switching. Fixes the kernel crash reported in #3419	2020-09-07 16:49:19 +02:00
AnotherTest	688e54eac7	Kernel: Distinguish between new and old process groups with equal pgids This does not add any behaviour change to the processes, but it ties a TTY to an active process group via TIOCSPGRP, and returns the TTY to the kernel when all processes in the process group die. Also makes the TTY keep a link to the original controlling process' parent (for SIGCHLD) instead of the process itself.	2020-08-19 21:21:34 +02:00
Ben Wiederhake	f5744a6f2f	Kernel: PID/TID typing This compiles, and contains exactly the same bugs as before. The regex 'FIXME: PID/' should reveal all markers that I left behind, including: - Incomplete conversion - Issues or things that look fishy - Actual bugs that will go wrong during runtime	2020-08-10 11:51:45 +02:00
Andreas Kling	949aef4aef	Kernel: Move syscall implementations out of Process.cpp This is something I've been meaning to do for a long time, and here we finally go. This patch moves all sys$foo functions out of Process.cpp and into files in Kernel/Syscalls/. It's not exactly one syscall per file (although it could be, but I got a bit tired of the repetitive work here..) This makes hacking on individual syscalls a lot less painful since you don't have to rebuild nearly as much code every time. I'm also hopeful that this makes it easier to understand individual syscalls. :^)	2020-07-30 23:40:57 +02:00

50 commits