beenull/ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2024-11-22 23:50:19 +00:00

Author	SHA1	Message	Date
Andreas Kling	fee20bd8de	Kernel: Remove some more harmless InodeVMObject miscasts	2020-03-01 12:27:03 +01:00
Andreas Kling	48bbfe51fb	Kernel: Add some InodeVMObject type assertions in Region::clone() Let's make sure that we're never cloning shared inode-backed objects as if they were private, and vice versa.	2020-03-01 11:23:10 +01:00
Andreas Kling	88b334135b	Kernel: Remove some Region construction helpers It's now up to the caller to provide a VMObject when constructing a new Region object. This will make it easier to handle things going wrong, like allocation failures, etc.	2020-03-01 11:23:10 +01:00
Andreas Kling	fddc3c957b	Kernel: CoW-clone private inode-backed memory regions on fork() When forking a process, we now turn all of the private inode-backed mmap() regions into copy-on-write regions in both the parent and child. This patch also removes an assertion that becomes irrelevant.	2020-03-01 11:23:10 +01:00
Andreas Kling	7cd1bdfd81	Kernel: Simplify some dbg() logging We don't have to log the process name/PID/TID, dbg() automatically adds that as a prefix to every line. Also we don't have to do .characters() on Strings passed to dbg() :^)	2020-02-29 13:39:06 +01:00
Andreas Kling	aa1e209845	Kernel: Remove some unnecessary indirection in InodeFile::mmap() InodeFile now directly calls Process::allocate_region_with_vmobject() instead of taking an awkward detour via a special Region constructor.	2020-02-28 20:29:14 +01:00
Andreas Kling	651417a085	Kernel: Split InodeVMObject into two subclasses We now have PrivateInodeVMObject and SharedInodeVMObject, corresponding to MAP_PRIVATE and MAP_SHARED respectively. Note that PrivateInodeVMObject is not used yet.	2020-02-28 20:20:35 +01:00
Andreas Kling	07a26aece3	Kernel: Rename InodeVMObject => SharedInodeVMObject	2020-02-28 20:07:51 +01:00
Liav A	24d2aeda8e	Region: Use dbg() instead of dbgprintf()	2020-02-27 13:05:12 +01:00
Andreas Kling	0763f67043	AK: Make Bitmap use size_t for its size Also rework its API's to return Optional<size_t> instead of int with -1 as the error value.	2020-02-24 09:56:07 +01:00
Andreas Kling	04e40da188	Kernel: Fix crash when reading /proc/PID/vmobjects InodeVMObjects can have nulled-out physical page slots. That just means we haven't cached that page from disk right now.	2020-02-21 16:03:56 +01:00
Andreas Kling	48f7c28a5c	Kernel: Replace "current" with Thread::current and Process::current Suggested by Sergey. The currently running Thread and Process are now Thread::current and Process::current respectively. :^)	2020-02-17 15:04:27 +01:00
Andreas Kling	1d611e4a11	Kernel: Reduce header dependencies of MemoryManager and Region	2020-02-16 01:33:41 +01:00
Andreas Kling	a356e48150	Kernel: Move all code into the Kernel namespace	2020-02-16 01:27:42 +01:00
Andreas Kling	c624d3875e	Kernel: Use a shared physical page for zero-filled pages until written This patch adds a globally shared zero-filled PhysicalPage that will be mapped into every slot of every zero-filled AnonymousVMObject until that page is written to, achieving CoW-like zero-filled pages. Initial testing show that this doesn't actually achieve any sharing yet but it seems like a good design regardless, since it may reduce the number of page faults taken by programs. If you look at the refcount of MM.shared_zero_page() it will have quite a high refcount, but that's just because everything maps it everywhere. If you want to see the "real" refcount, you can build with the MAP_SHARED_ZERO_PAGE_LAZILY flag, and we'll defer mapping of the shared zero page until the first NP read fault. I've left this behavior behind a flag for future testing of this code.	2020-02-15 13:17:40 +01:00
Andreas Kling	00d8ec3ead	Kernel: The inode fault handler should grab the VMObject lock earlier It doesn't look healthy to create raw references into an array before a temporary unlock. In fact, that temporary unlock looks generally unhealthy, but it's a different problem.	2020-02-08 12:55:21 +01:00
Andreas Kling	a9d7902bb7	x86: Simplify region unmapping a bit Add PageTableEntry::clear() to zero out a whole PTE, and use that for unmapping instead of clearing individual fields.	2020-02-08 12:49:38 +01:00
Andreas Kling	f91b3aab47	Kernel: Cloned shared regions should also be marked as shared	2020-02-08 02:39:46 +01:00
Andreas Kling	f38cfb3562	Kernel: Tidy up debug logging a little bit When using dbg() in the kernel, the output is automatically prefixed with [Process(PID:TID)]. This makes it a lot easier to understand which thread is generating the output. This patch also cleans up some common logging messages and removes the now-unnecessary "dbg() << *current << ..." pattern.	2020-01-21 16:16:20 +01:00
Andreas Kling	4ebff10bde	Kernel: Write-only regions should still be mapped as present There is no real "read protection" on x86, so we have no choice but to map write-only pages simply as "present & read/write". If we get a read page fault in a non-readable region, that's still a correctness issue, so we crash the process. It's by no means a complete protection against invalid reads, since it's trivial to fool the kernel by first causing a write fault in the same region.	2020-01-20 13:13:03 +01:00
Andreas Kling	f7b394e9a1	Kernel: Assert that copy_to/from_user() are called with user addresses This will panic the kernel immediately if these functions are misused so we can catch it and fix the misuse. This patch fixes a couple of misuses: - create_signal_trampolines() writes to a user-accessible page above the 3GB address mark. We should really get rid of this page but that's a whole other thing. - CoW faults need to use copy_from_user rather than copy_to_user since it's the source pointer that points to user memory. - Inode faults need to use memcpy rather than copy_to_user since we're copying a kernel stack buffer into a quickmapped page. This should make the copy_to/from_user() functions slightly less useful for exploitation. Before this, they were essentially just glorified memcpy() with SMAP disabled. :^)	2020-01-19 09:18:55 +01:00
Andreas Kling	94ca55cefd	Meta: Add license header to source files As suggested by Joshua, this commit adds the 2-clause BSD license as a comment block to the top of every source file. For the first pass, I've just added myself for simplicity. I encourage everyone to add themselves as copyright holders of any file they've added or modified in some significant way. If I've added myself in error somewhere, feel free to replace it with the appropriate copyright holder instead. Going forward, all new source files should include a license header.	2020-01-18 09:45:54 +01:00
Andreas Kling	e362b56b4f	Kernel: Move kernel above the 3GB virtual address mark The kernel and its static data structures are no longer identity-mapped in the bottom 8MB of the address space, but instead move above 3GB. The first 8MB above 3GB are pseudo-identity-mapped to the bottom 8MB of the physical address space. But things don't have to stay this way! Thanks to Jesse who made an earlier attempt at this, it was really easy to get device drivers working once the page tables were in place! :^) Fixes #734.	2020-01-17 22:34:26 +01:00
Liav A	d2b41010c5	Kernel: Change Region allocation helpers We now can create a cacheable Region, so when map() is called, if a Region is cacheable then all the virtual memory space being allocated to it will be marked as not cache disabled. In addition to that, OS components can create a Region that will be mapped to a specific physical address by using the appropriate helper method.	2020-01-14 15:38:58 +01:00
Andreas Kling	5c3c2a9bac	Kernel: Copy Region's "is_mmap" flag when cloning regions for fork() Otherwise child processes will not be allowed to munmap(), madvise(), etc. on the cloned regions!	2020-01-10 19:24:01 +01:00
Andreas Kling	197e73ee31	Kernel+LibELF: Enable SMAP protection during non-syscall exec() When loading a new executable, we now map the ELF image in kernel-only memory and parse it there. Then we use copy_to_user() when initializing writable regions with data from the executable. Note that the exec() syscall still disables SMAP protection and will require additional work. This patch only affects kernel-originated process spawns.	2020-01-10 10:57:06 +01:00
Andreas Kling	9eef39d68a	Kernel: Start implementing x86 SMAP support Supervisor Mode Access Prevention (SMAP) is an x86 CPU feature that prevents the kernel from accessing userspace memory. With SMAP enabled, trying to read/write a userspace memory address while in the kernel will now generate a page fault. Since it's sometimes necessary to read/write userspace memory, there are two new instructions that quickly switch the protection on/off: STAC (disables protection) and CLAC (enables protection.) These are exposed in kernel code via the stac() and clac() helpers. There's also a SmapDisabler RAII object that can be used to ensure that you don't forget to re-enable protection before returning to userspace code. THis patch also adds copy_to_user(), copy_from_user() and memset_user() which are the "correct" way of doing things. These functions allow us to briefly disable protection for a specific purpose, and then turn it back on immediately after it's done. Going forward all kernel code should be moved to using these and all uses of SmapDisabler are to be considered FIXME's. Note that we're not realizing the full potential of this feature since I've used SmapDisabler quite liberally in this initial bring-up patch.	2020-01-05 18:14:51 +01:00
Andreas Kling	ea1911b561	Kernel: Share code between Region::map() and Region::remap_page() These were doing mostly the same things, so let's just share the code.	2020-01-01 19:32:55 +01:00
Andreas Kling	5aeaab601e	Kernel: Move CPU feature detection to Arch/x86/CPU.{cpp.h} We now refuse to boot on machines that don't support PAE since all of our paging code depends on it. Also let's only enable SSE and PGE support if the CPU advertises it.	2020-01-01 12:57:00 +01:00
Andreas Kling	1f31156173	Kernel: Add a mode flag to sys$purge and allow purging clean inodes	2019-12-29 13:16:53 +01:00
Andreas Kling	0d5e0e4cad	Kernel+SystemMonitor: Expose amount of per-process dirty private memory Dirty private memory is all memory in non-inode-backed mappings that's process-private, meaning it's not shared with any other process. This patch exposes that number via SystemMonitor, giving us an idea of how much memory each process is responsible for all on its own.	2019-12-29 12:28:32 +01:00
Conrad Pankoff	17aef7dc99	Kernel: Detect support for no-execute (NX) CPU features Previously we assumed all hosts would have support for IA32_EFER.NXE. This is mostly true for newer hardware, but older hardware will crash and burn if you try to use this feature. Now we check for support via CPUID.80000001[20].	2019-12-26 10:05:51 +01:00
Andreas Kling	ce5f7f6c07	Kernel: Use the CPU's NX bit to enforce PROT_EXEC on memory mappings Now that we have PAE support, we can ask the CPU to crash processes for trying to execute non-executable memory. This is pretty cool! :^)	2019-12-25 13:35:57 +01:00
Andreas Kling	ae2d72377d	Kernel: Enable the x86 WP bit to catch invalid memory writes in ring 0 Setting this bit will cause the CPU to generate a page fault when writing to read-only memory, even if we're executing in the kernel. Seemingly the only change needed to make this work was to have the inode-backed page fault handler use a temporary mapping for writing the read-from-disk data into the newly-allocated physical page.	2019-12-21 16:21:13 +01:00
Andreas Kling	b6ee8a2c8d	Kernel: Rename vmo => vmobject everywhere	2019-12-19 19:15:27 +01:00
Andreas Kling	1d4d6f16b2	Kernel: Add a specific-page variant of Region::commit()	2019-12-18 22:43:32 +01:00
Andreas Kling	931e4b7f5e	Kernel+SystemMonitor: Prevent userspace access to process ELF image Every process keeps its own ELF executable mapped in memory in case we need to do symbol lookup (for backtraces, etc.) Until now, it was mapped in a way that made it accessible to the program, despite the program not having mapped it itself. I don't really see a need for userspace to have access to this right now, so let's lock things down a little bit. This patch makes it inaccessible to userspace and exposes that fact through /proc/PID/vm (per-region "user_accessible" flag.)	2019-12-15 20:11:57 +01:00
Andreas Kling	05a441afb2	Kernel: Don't turn private read-only regions into shared ones on fork Even if they are read-only now, they can be mprotect(PROT_WRITE)'d in the future, so we have to make sure they are CoW mapped.	2019-12-15 16:53:46 +01:00
Andreas Kling	3fbc50a350	Kernel+SystemMonitor: Expose the number of set CoW bits in each Region This number tells us how many more pages in a given region will trigger a CoW fault if written to.	2019-12-15 16:53:00 +01:00
Andreas Kling	dbb644f20c	Kernel: Start implementing purgeable memory support It's now possible to get purgeable memory by using mmap(MAP_PURGEABLE). Purgeable memory has a "volatile" flag that can be set using madvise(): - madvise(..., MADV_SET_VOLATILE) - madvise(..., MADV_SET_NONVOLATILE) When in the "volatile" state, the kernel may take away the underlying physical memory pages at any time, without notifying the owner. This gives you a guilt discount when caching very large things. :^) Setting a purgeable region to non-volatile will return whether or not the memory has been taken away by the kernel while being volatile. Basically, if madvise(..., MADV_SET_NONVOLATILE) returns 1, that means the memory was purged while volatile, and whatever was in that piece of memory needs to be reconstructed before use.	2019-12-09 19:12:38 +01:00
Andreas Kling	05c65fb4f1	Kernel: Don't CoW non-writable pages A page fault in a page marked for CoW should not trigger a CoW if the page is non-writable. I think this makes sense.	2019-12-02 19:20:09 +01:00
Andreas Kling	f41ae755ec	Kernel: Crash on memory access in non-readable regions This patch makes it possible to make memory regions non-readable. This is enforced using the "present" bit in the page tables. A process that hits an not-present page fault in a non-readable region will be crashed.	2019-12-02 19:18:52 +01:00
Andreas Kling	5b8cf2ee23	Kernel: Make syscall counters and page fault counters per-thread Now that we show individual threads in SystemMonitor and "top", it's also very nice to have individual counters for the threads. :^)	2019-11-26 21:37:38 +01:00
Andreas Kling	9a157b5e81	Revert "Kernel: Move Kernel mapping to 0xc0000000" This reverts commit `bd33c66273`. This broke the network card drivers, since they depended on kmalloc addresses being identity-mapped.	2019-11-23 17:27:09 +01:00
Jesse Buhagiar	bd33c66273	Kernel: Move Kernel mapping to 0xc0000000 The kernel is now no longer identity mapped to the bottom 8MiB of memory, and is now mapped at the higher address of `0xc0000000`. The lower ~1MiB of memory (from GRUB's mmap), however is still identity mapped to provide an easy way for the kernel to get physical pages for things such as DMA etc. These could later be mapped to the higher address too, as I'm not too sure how to go about doing this elegantly without a lot of address subtractions.	2019-11-22 16:23:23 +01:00
Andreas Kling	794758df3a	Kernel: Implement some basic stack pointer validation VM regions can now be marked as stack regions, which is then validated on syscall, and on page fault. If a thread is caught with its stack pointer pointing into anything that's not a Region with its stack bit set, we'll crash the whole process with SIGSTKFLT. Userspace must now allocate custom stacks by using mmap() with the new MAP_STACK flag. This mechanism was first introduced in OpenBSD, and now we have it too, yay! :^)	2019-11-17 12:15:43 +01:00
Andreas Kling	a6e9119537	Kernel: Tweak some outdated kprintfs in Region	2019-11-04 00:48:45 +01:00
Andreas Kling	d67c6a92db	Kernel: Move page fault handling from MemoryManager to Region After the page fault handler has found the region in which the fault occurred, do the rest of the work in the region itself. This patch also makes all fault types consistently crash the process if a new page is needed but we're all out of pages.	2019-11-04 00:47:03 +01:00
Andreas Kling	0e8f1d7cb6	Kernel: Don't expose a region's page directory to the outside world Now that region manages its own mapping/unmapping, there's no need for the outside world to be able to grab at its page directory.	2019-11-04 00:26:00 +01:00
Andreas Kling	6ed9cc4717	Kernel: Remove Region API's for setting/unsetting the page directory This is done implicitly by mapping or unmapping the region.	2019-11-04 00:24:20 +01:00

1 2

79 commits