0ct0pu5/ladybird

Author	SHA1	Message	Date
Stephan Unverwerth	bb28492af0	LibSoftGPU: Make output in PixelQuad generic Same as with inputs, we define outputs as a generic array of floats. This can later be expanded to accomodate multiple render targets or vertex attributes in the case of a vertex shader.	2022-12-17 22:39:09 -07:00
Stephan Unverwerth	c008b6ce18	LibSoftGPU: Make input in PixelQuad generic Previously we would store vertex color and texture coordinates in separate fields in PixelQuad. To make them accessible from shaders we need to store them as a completely generic array of floats.	2022-12-17 22:39:09 -07:00
cflip	abc0c44f0b	LibGL+LibGPU+LibSoftGPU: Report maximum texture size	2022-10-19 22:07:05 +02:00
Jelle Raaijmakers	6dcc808994	LibSoftGPU: Reduce subpixel precision from 6 to 4 bits With 6 bits of precision, the maximum triangle coordinate we can handle is sqrt(2^31 / (1 << 6)^2) = ~724. Rendering to a target of 800x600 or higher quickly becomes a mess because of integer overflow. By reducing the subpixel precision to 4 bits, we support coordinates up to ~2896, which means that we can (try to) render to target sizes like 2560x1440. This fixes the main menu backdrop for the Half-Life port. It also introduces more white pixel artifacts in Quake's water / lava rendering, but this is a level geometry visualization bug (see `r_novis`).	2022-09-13 20:20:03 +02:00
Jelle Raaijmakers	1d36bfdac1	LibGL+LibSoftGPU: Implement fixed pipeline support for `GL_COMBINE` `GL_COMBINE` is basically a fixed function calculator to perform simple arithmetics on configurable fragment sources. This patch implements a number of texture env parameters with support for the RGBA internal format.	2022-09-11 22:37:07 +01:00
RKBethke	0836912a6d	LibGL+LibGPU+LibSoftGPU: Implement and expose glClipPlane This commit implements glClipPlane and its supporting calls, backed by new support for user-defined clip planes in the software GPU clipper. This fixes some visual bugs seen in the Quake III port, in which mirrors would only reflect correctly from close distances.	2022-05-11 23:09:47 +02:00
Jelle Raaijmakers	526390ec06	LibSoftGPU: Move back to `i32`-based subpixels Our move to floating point precision has eradicated the pixel artifacts in Quake 1, but introduced new and not so subtle rendering glitches in games like Tux Racer. This commit changes three things to get the best of both worlds: 1. Subpixel logic based on `i32` types was reintroduced, the number of bits is set to 6. This reintroduces the artifacts in Quake 1 but fixes rendering of Tux Racer. 2. Before triangle culling, subpixel coordinates are calculated and stored in `Triangle`. These coordinates are rounded, which fixes the Quake 1 artifacts. Tux Racer is unaffected. 3. The triangle area (actually parallelogram area) is also stored in `Triangle` so we don't need to recalculate it later on. In our previous subpixel code, there was a subtle disconnect between the two calculations (one with and one without subpixel precision) which resulted in triangles incorrectly being culled. This fixes some remaining Quake 1 artifacts.	2022-05-05 20:50:46 +02:00
Stephan Unverwerth	5d2740217f	LibGL+LibGPU+LibSoftGPU: Move Vertex.h to LibGPU	2022-04-06 11:32:24 +02:00
Stephan Unverwerth	e416380826	LibGL+LibGPU+LibSoftGPU: Move StencilConfiguration.h to LibGPU	2022-04-06 11:32:24 +02:00
Stephan Unverwerth	24d420312c	LibGL+LibGPU+LibSoftGPU: Move Enums.h to LibGPU	2022-04-06 11:32:24 +02:00
Jelle Raaijmakers	37dd10fbbe	LibSoftGPU: Use `float` instead of `int` for triangle screen coords This replaces the fixed point subpixel precision logic. GLQuake now effectively renders artifact-free. Previously white/gray pixels would sometimes be visible at triangle edges, caused by slightly misaligned triangle edges as a result of converting the vertex window coordinates to `int`. These artifacts were reduced by the introduction of subpixel precision, but not completely eliminated. Some interesting changes in this commit: * Applying the top-left rule for our counter-clockwise vertices is now done with simpler conditions: every vertex that has a Y coordinate lower than or equal to the previous vertex' Y coordinate is counted as a top or left edge. A float epsilon is used to emulate a switch between `> 0` and `>= 0` comparisons. * Fog depth calculation into a `f32x4` is now done once per triangle instead of once per fragment, and only if fog is enabled. * The `one_over_area` value was previously calculated as `1.0f / area`, where `area` was an `int`. This resulted in a lower quality reciprocal value whereas we can now retain floating point precision. The effect of this can be seen in Tux Racer, where the ice reflection is noticeably smoother.	2022-03-07 11:00:45 +01:00
Lenny Maiorani	065525aba0	LibSoftGPU: Configure stats overlay period Problem: - The statistics overlay period is hardcoded to 500 ms. This time is very short and can result in the values being very "jumpy". Solution: - Increasing this value can result in more steady values which is useful when trying to evaluate the performance impact of a change. A new config value is offered in `Config.h` to let the developer change to any value desired.	2022-01-22 08:57:31 +03:30
Stephan Unverwerth	a5040ecdfc	LibSoftGPU: Reduce number of samplers to 2 OpenGL mandates at least 2 texture units when multitexturing is supported. This keeps our vertices lean and gives a nice speed improvement in glquake. Until we support shaders this should be enough.	2022-01-19 19:57:49 +01:00
Jesse Buhagiar	192befa84b	LibGL+LibSoftGPU: Add `GL_MAX_LIGHTS` to get_context_parameter This is required to allow lighting to work properly in the GL. We currently have the maximum number of lights in the software GL context set to 8, as this is the minimum that OpenGL mandates according to the spec.	2022-01-12 13:36:56 +01:00
Stephan Unverwerth	57215d0e1f	LibSoftGPU: Allow arbitrary render target sizes With the RASTERIZER_BLOCK_SIZE gone we can now render to any size, even odd ones. We have to be careful to not generate out of bounds accesses when calculating the render target and depth buffer pointers. Thus we check the coverage mask and generate nullptrs for pixels that will not be updated. This also masks out pixels that would touch the triangle but are outside the render target/scissor rect bounds.	2022-01-09 16:21:13 +03:30
Stephan Unverwerth	8ae3eb6c33	LibSoftGPU: Implement 5 bits of subpixel precision This snaps vertices to 1/32 of a pixel before rasterization resulting in smoother movement and less floaty appearance of moving triangles. This also reduces the severity of the artifacts in the glquake port. 5 bits should allow up to 1024x1024 render targets. Anything larger needs a different implementation.	2022-01-06 17:55:05 +01:00
Stephan Unverwerth	b7c0c32f24	LibSoftGPU: Add option to render a debug overlay This displays statistics regarding frame timings and number of pixels rendered. Timings are based on the time between draw_debug_overlay() invocations. This measures actual number of frames presented to the user vs. wall clock time so this also includes everything the app might do besides rendering. Triangles are counted after clipping. This number might actually be higher than the number of triangles coming from LibGL. Pixels are counted after the initial scissor and coverage test. Pixels rejected here are not counted. Shaded pixels is the percentage of all pixels that made it to the shading stage. Blended pixels is the percentage of shaded pixels that were alpha blended to the color buffer. Overdraw measures how many pixels were shaded vs. how many pixels the render target has. e.g. a 640x480 render target has 307200 pixels. If exactly that many pixels are shaded the overdraw number will read 0%. 614400 shaded pixels will read as an overdraw of 100%. Sampler calls is simply the number of times sampler.sample_2d() was called.	2022-01-01 15:09:21 +01:00
Stephan Unverwerth	fe36edf6ae	LibSoftGPU: Put all constexpr config options into Config.h	2022-01-01 15:09:21 +01:00

18 commits