0ct0pu5/ladybird

Author	SHA1	Message	Date
Andreas Kling	0c44a12247	Kernel: read() and write() should EOVERFLOW if (offset+size) overflows	2020-01-12 20:20:17 +01:00
Andreas Kling	14d4b1058e	Kernel: Add a basic lock to FileDescription Let's prevent two processes sharing a FileDescription from messing with it at the same time for now.	2020-01-12 20:09:44 +01:00
Sergey Bugaev	33c0dc08a7	Kernel: Don't forget to copy & destroy root_directory_for_procfs Also, rename it to root_directory_relative_to_global_root.	2020-01-12 20:02:11 +01:00
Sergey Bugaev	fee6d0a3a6	Kernel+Base: Mount root as nodev,nosuid Then bind-mount /dev and /bin while adding back the appropriate permissions :^)	2020-01-12 20:02:11 +01:00
Sergey Bugaev	93ff911473	Kernel: Properly propagate bind mount flags Previously, when performing a bind mount flags other than MS_BIND were ignored. Now, they're properly propagated the same way a for any other mount.	2020-01-12 20:02:11 +01:00
Sergey Bugaev	3393b78623	Kernel: Allow getting a Device from a FileDescription Like we already do for other kinds of files.	2020-01-12 20:02:11 +01:00
Andreas Kling	cb59f9e0f2	Kernel: Put some VFS debug spam behind VFS_DEBUG	2020-01-12 10:01:22 +01:00
Andreas Kling	b36608f47c	ProcFS: Expose process pledge promises in /proc/all	2020-01-11 21:33:12 +01:00
Sergey Bugaev	0cb0f54783	Kernel: Implement bind mounts You can now bind-mount files and directories. This essentially exposes an existing part of the file system in another place, and can be used as an alternative to symlinks or hardlinks. Here's an example of doing this: # mkdir /tmp/foo # mount /home/anon/myfile.txt /tmp/foo -o bind # cat /tmp/foo This is anon's file.	2020-01-11 18:57:53 +01:00
Sergey Bugaev	61c1106d9f	Kernel+LibC: Implement a few mount flags We now support these mount flags: * MS_NODEV: disallow opening any devices from this file system * MS_NOEXEC: disallow executing any executables from this file system * MS_NOSUID: ignore set-user-id bits on executables from this file system The fourth flag, MS_BIND, is defined, but currently ignored.	2020-01-11 18:57:53 +01:00
Sergey Bugaev	2fcbb846fb	Kernel+LibC: Add O_EXEC, move exec permission checking to VFS::open() O_EXEC is mentioned by POSIX, so let's have it. Currently, it is only used inside the kernel to ensure the process has the right permissions when opening an executable.	2020-01-11 18:57:53 +01:00
Sergey Bugaev	4566c2d811	Kernel+LibC: Add support for mount flags At the moment, the actual flags are ignored, but we correctly propagate them all the way from the original mount() syscall to each custody that resides on the mounted FS.	2020-01-11 18:57:53 +01:00
Sergey Bugaev	1e6ab0ed22	Kernel: Simplify VFS::Mount handling No need to pass around RefPtr<>s and NonnullRefPtr<>s and no need to heap-allocate them. Also remove VFS::mount(NonnullRefPtr<FS>&&, StringView path) - it has been unused for a long time.	2020-01-11 18:57:53 +01:00
Andreas Kling	29b3d95004	Kernel: Expose a process's filesystem root as a /proc/PID/root symlink In order to preserve the absolute path of the process root, we save the custody used by chroot() before stripping it to become the new "/". There's probably a better way to do this.	2020-01-10 23:48:44 +01:00
Andreas Kling	ddd0b19281	Kernel: Add a basic chroot() syscall :^) The chroot() syscall now allows the superuser to isolate a process into a specific subtree of the filesystem. This is not strictly permanent, as it is also possible for a superuser to break out of a chroot, but it is a useful mechanism for isolating unprivileged processes. The VFS now uses the current process's root_directory() as the root for path resolution purposes. The root directory is stored as an uncached Custody in the Process object.	2020-01-10 23:14:04 +01:00
Andreas Kling	944fbf507a	Kernel: Custody::absolute_path() should always return "/" for roots A Custody with no parent is always a root (although not necessarily the real root.)	2020-01-10 23:09:58 +01:00
Andreas Kling	b1ffde6199	Kernel: unlink() should not follow symlinks	2020-01-10 14:07:36 +01:00
Andreas Kling	7380c8ec6e	TmpFS: Synthesize "." and ".." in traverse_as_directory() As Sergey pointed out, it's silly to have proper entries for . and .. in TmpFS when we can just synthesize them on the fly. Note that we have to tolerate removal of . and .. via remove_child() to keep VFS::rmdir() happy.	2020-01-10 13:16:55 +01:00
Andreas Kling	59bfbed2e2	ProcFS: Don't expose kernel-only regions to users via /proc/PID/vm The superuser is still allowed to see them, but kernel-only VM regions are now excluded from /proc/PID/vm.	2020-01-10 10:57:33 +01:00
Andreas Kling	d310cf3b49	Kernel: Opening a file with O_TRUNC should update mtime	2020-01-08 15:21:06 +01:00
Andreas Kling	e485667201	Kernel: ftruncate() should update mtime	2020-01-08 15:21:06 +01:00
Andreas Kling	fe1bf067b8	ProcFS: Reads past the end of a generated file should be zero-length	2020-01-08 12:59:06 +01:00
Andreas Kling	28ee5b0e98	TmpFS: Reads past the end of a file should be zero-length	2020-01-08 12:47:41 +01:00
Andreas Kling	faf32153f6	Kernel: Take const Process& in InodeMetadata::may_{read,write,execute}	2020-01-07 19:24:06 +01:00
Andreas Kling	5387a19268	Kernel: Make Process::file_description() vend a RefPtr<FileDescription> This encourages callers to strongly reference file descriptions while working with them. This fixes a use-after-free issue where one thread would close() an open fd while another thread was blocked on it becoming readable. Test: Kernel/uaf-close-while-blocked-in-read.cpp	2020-01-07 15:53:42 +01:00
Andreas Kling	a49d9c774f	TmpFS: Add ASSERT(offset >= 0) to read_bytes() and write_bytes()	2020-01-07 15:25:56 +01:00
Andreas Kling	bb9db9d430	TmpFS: Add "." and ".." entries to all directories It was so weird not seeing them in "ls -la" output :^)	2020-01-07 14:48:43 +01:00
Andreas Kling	56a2c21e0c	Kernel: Don't leak kmalloc pointers through FIFO absolute paths Instead of using the FIFO's memory address as part of its absolute path identity, just use an incrementing FIFO index instead. Note that this is not used for anything other than debugging (it helps you identify which file descriptors refer to the same FIFO by looking at /proc/PID/fds	2020-01-07 10:29:47 +01:00
Andreas Kling	9eef39d68a	Kernel: Start implementing x86 SMAP support Supervisor Mode Access Prevention (SMAP) is an x86 CPU feature that prevents the kernel from accessing userspace memory. With SMAP enabled, trying to read/write a userspace memory address while in the kernel will now generate a page fault. Since it's sometimes necessary to read/write userspace memory, there are two new instructions that quickly switch the protection on/off: STAC (disables protection) and CLAC (enables protection.) These are exposed in kernel code via the stac() and clac() helpers. There's also a SmapDisabler RAII object that can be used to ensure that you don't forget to re-enable protection before returning to userspace code. THis patch also adds copy_to_user(), copy_from_user() and memset_user() which are the "correct" way of doing things. These functions allow us to briefly disable protection for a specific purpose, and then turn it back on immediately after it's done. Going forward all kernel code should be moved to using these and all uses of SmapDisabler are to be considered FIXME's. Note that we're not realizing the full potential of this feature since I've used SmapDisabler quite liberally in this initial bring-up patch.	2020-01-05 18:14:51 +01:00
Andreas Kling	12eb1f5d74	Kernel: Entries in /dev/pts should be accessible only to the owner This fixes an issue where anyone could snoop on any pseudoterminal.	2020-01-04 12:46:48 +01:00
Andreas Kling	b5da0b78eb	Kernel: File::open() should apply r/w mode from the provided options This has been a FIXME for a long time. We now apply the provided read/write permissions to the constructed FileDescription when opening a File object via File::open().	2020-01-04 12:30:55 +01:00
Andreas Kling	e79c33eabb	Kernel: The root inode of a TmpFS should have the sticky bit set We were running without the sticky bit and mode 777, which meant that the /tmp directory was world-writable without protection. With this fixed, it's no longer possible for everyone to steal root's files in /tmp.	2020-01-04 11:33:36 +01:00
Andreas Kling	d84299c7be	Kernel: Allow fchmod() and fchown() on pre-bind() local sockets In order to ensure a specific owner and mode when the local socket filesystem endpoint is instantiated, we need to be able to call fchmod() and fchown() on a socket fd between socket() and bind(). This is because until we call bind(), there is no filesystem inode for the socket yet.	2020-01-03 20:14:56 +01:00
Andreas Kling	4abbedb6e4	Kernel: Allow passing initial UID and GID when creating new inodes If we're creating something that should have a different owner than the current process's UID/GID, we need to plumb that all the way through VFS down to the FS functions.	2020-01-03 20:13:21 +01:00
Andreas Kling	82760998a9	Ext2FS: Take the inode lock in Ext2FSInode::metadata() Remove an unnecessary InterruptDisabler to make this not assert. :^)	2020-01-03 17:48:02 +01:00
Andreas Kling	889ecd1375	Kernel: The superuser is allowed to utime() on any file Before this patch, root was not able to "touch" someone else's file.	2020-01-03 04:14:41 +01:00
Andreas Kling	3f74e66e82	Kernel: rename() should fail with EXDEV for cross-device requests POSIX does not support rename() from one file system to another.	2020-01-03 04:10:05 +01:00
Andreas Kling	3be1c7b514	Kernel: Fix awkward bug where "touch /foo/bar/baz" could create "/baz" To accomodate file creation, path resolution optionally returns the last valid parent directory seen while traversing the path. Clients will then interpret "ENOENT, but I have a parent for you" as meaning that the file doesn't exist, but its immediate parent directory does. The client then goes ahead and creates a new file. In the case of "/foo/bar/baz" where there is no "/foo", it would fail with ENOENT and "/" as the last seen parent directory, causing e.g the open() syscall to create "/baz". Covered by test_io.	2020-01-03 03:57:10 +01:00
Andreas Kling	0a1865ebc6	Kernel: read() and write() should fail with EBADF for wrong mode fd's It was previously possible to write to read-only file descriptors, and read from write-only file descriptors. All FileDescription objects now start out non-readable + non-writable, and whoever is creating them has to "manually" enable reading/writing by calling set_readable() and/or set_writable() on them.	2020-01-03 03:29:59 +01:00
Andreas Kling	064e46e581	Kernel: Don't allow open() with (O_CREAT \| O_DIRECTORY)	2020-01-03 03:16:29 +01:00
Andreas Kling	15f3abc849	Kernel: Handle O_DIRECTORY in VFS::open() instead of in each syscall Just taking care of some FIXMEs.	2020-01-03 03:16:29 +01:00
Andreas Kling	fdde5cdf26	Kernel: Don't include the process GID in the "extra GIDs" table Process::m_extra_gids is for supplementary GIDs only.	2020-01-02 23:45:52 +01:00
Andreas Kling	32ec1e5aed	Kernel: Mask kernel addresses in backtraces and profiles Addresses outside the userspace virtual range will now show up as 0xdeadc0de in backtraces and profiles generated by unprivileged users.	2020-01-02 20:51:31 +01:00
Andreas Kling	7f04334664	Kernel: Remove broken implementation of Unix SHM This code never worked, as was never used for anything. We can build a much better SHM implementation on top of TmpFS or similar when we get to the point when we need one.	2020-01-02 12:44:21 +01:00
Liav A	e5ffa960d7	Kernel: Create support for PCI ECAM The new PCI subsystem is initialized during runtime. PCI::Initializer is supposed to be called during early boot, to perform a few tests, and initialize the proper configuration space access mechanism. Kernel boot parameters can be specified by a user to determine what tests will occur, to aid debugging on problematic machines. After that, PCI::Initializer should be dismissed. PCI::IOAccess is a class that is derived from PCI::Access class and implements PCI configuration space access mechanism via x86 IO ports. PCI::MMIOAccess is a class that is derived from PCI::Access and implements PCI configurtaion space access mechanism via memory access. The new PCI subsystem also supports determination of IO/MMIO space needed by a device by checking a given BAR. In addition, Every device or component that use the PCI subsystem has changed to match the last changes.	2020-01-02 00:50:09 +01:00
Andreas Kling	d8ef13a426	ProcFS: Supervisor-only inodes should be owned by UID 0, GID 0	2019-12-31 13:22:43 +01:00
Andreas Kling	9af054af9e	ProcFS: Reduce the amount of info accessible to non-superusers This patch hardens /proc a bit by making many things only accessible to UID 0, and also disallowing access to /proc/PID/ for anyone other than the UID of that process (and superuser, obviously.)	2019-12-31 01:32:27 +01:00
Andreas Kling	54d182f553	Kernel: Remove some unnecessary leaking of kernel pointers into dmesg There's a lot more of this and we need to stop printing kernel pointers anywhere but the debug console.	2019-12-31 01:22:00 +01:00
Andreas Kling	50677bf806	Kernel: Refactor scheduler to use dynamic thread priorities Threads now have numeric priorities with a base priority in the 1-99 range. Whenever a runnable thread is not scheduled, its effective priority is incremented by 1. This is tracked in Thread::m_extra_priority. The effective priority of a thread is m_priority + m_extra_priority. When a runnable thread is scheduled, its m_extra_priority is reset to zero and the effective priority returns to base. This means that lower-priority threads will always eventually get scheduled to run, once its effective priority becomes high enough to exceed the base priority of threads "above" it. The previous values for ThreadPriority (Low, Normal and High) are now replaced as follows: Low -> 10 Normal -> 30 High -> 50 In other words, it will take 20 ticks for a "Low" priority thread to get to "Normal" effective priority, and another 20 to reach "High". This is not perfect, and I've used some quite naive data structures, but I think the mechanism will allow us to build various new and interesting optimizations, and we can figure out better data structures later on. :^)	2019-12-30 18:46:17 +01:00
Andreas Kling	c74cde918a	Kernel+SystemMonitor: Expose amount of per-process clean inode memory This is memory that's loaded from an inode (file) but not modified in memory, so still identical to what's on disk. This kind of memory can be freed and reloaded transparently from disk if needed.	2019-12-29 12:45:58 +01:00

1 2 3 4 5 ...

321 commits