dolphin

Commit Graph

Author	SHA1	Message	Date
Techjar	b267f54259	VideoCommon: Only include centered pixels in bounding box At higher resolutions, our bounding box dimensions end up being slightly larger than original hardware in some cases. This is not necessarily wrong, it's just an artifact of rendering at a higher resolution, due to bringing out detail that wouldn't have appeared on original hardware. It causes a texel to fall partially on what would have been a single pixel at native resolution, resulting in the coordinates getting bumped up to the next valid value. In many cases, these slightly larger bounding boxes are perfectly fine, as games don't hard-code expected dimensions. It is problematic in Paper Mario TTYD though, for a somewhat complicated reason. Paper Mario TTYD frequently uses EFB copies to pre-render a bunch of animation frames for a character sprite (especially in Chapter 2), so that it can then render 100 or more of them without bringing the GameCube to its knees. Based on my observation, the game seems to set aside a region of memory to store these EFB copies. This region is obviously fairly small, as the GameCube only has 24MB of RAM. There are 2 rooms in Chapter 2 where you fight a horde of as many as 100 Jabbies, which are also rendered using EFB copies, so in this room the game ends up making 130(!) EFB copies just for Puni and Jabbi sprites. This seems to nearly fill the region of memory it set aside for them. Unfortunately, our slightly larger bounding boxes at higher resolutions results in overflowing this memory, causing very strange behavior. Some EFB copies partially overlap game state, resulting in reading it as a garbage RGB5A3 texture that constantly changes. Others apparently somehow trigger a corner case in our persistent buffer mapping, causing them to partially overwrite earlier EFB copies. What this change does is only include the screen coordinates that align with the equivalent native resolution pixel centers, which generally results in the bounding boxes being more in line with original hardware. It isn't perfect, but it's enough to fix Paper Mario TTYD's Jabbi rooms by avoiding the buffer overflow. Notably, it is more accurate at odd resolutions than at even resolutions. Native resolution is completely unaffected by this change, as should be the case. This change may also have a small positive impact on shader performance at higher resolutions, as there will be less atomic operations performed.	2021-06-10 21:49:52 -04:00
Pokechu22	820d9ffbfa	Remove PixelShaderGen hasindstage	2021-06-09 20:50:49 -07:00
Techjar	4866002c9b	VideoCommon: Perform OpenGL bounding box inversion in pixel shader Running the min/max operation on the upside down, quad-rounded pixel coordinates before inverting them to the standard upper-left origin produces wrong results. Therefore, we need to do the inversion before rounding to pixel quads.	2021-06-06 20:55:06 -04:00
Techjar	83d55704aa	VideoCommon: Round bounding box coordinates down and remove pixel center offset Fragment coordinates always have a 0.5 offset from a whole integer, as that's where the pixel center is on modern GPUs. Therefore, we want to always round the fragment coordinates down for bounding box calculations. This also renders the pixel center offset useless, as 0.5 vs ~0.5833333 makes no difference when rounding down.	2021-06-05 00:34:10 -04:00
Pokechu22	c58837964f	VideoCommon: Fix bounding box on AMD/OpenGL/Windows Co-authored-by: Techjar <tecknojar@gmail.com>	2021-05-31 16:22:50 -07:00
Techjar	e4aef0a85b	VideoCommon: Move bounding box pixel quads rounding to shader This avoids rounding values that the game writes to the bounding box registers, especially the default values.	2021-05-28 23:30:22 -04:00
Pokechu22	5928182a4c	Skip indirect operation for out of bounds indirect stages This fixes rendering issues in Viewtiful Joe (https://bugs.dolphin-emu.org/issues/12525), but it is not entirely hardware accurate, as hardware testing showed other, more complex behavior in this case. However, it should be good enough for our purposes.	2021-05-27 22:13:42 -07:00
Pokechu22	b5844ab195	PixelShaderGen: always run indirect stage logic Hardware testing has confirmed that fb_addprev and wrapping both run even when the indirect stage is disabled.	2021-05-07 16:37:47 -07:00
Tillmann Karras	f6cf85a8bc	PixelShaderGen: Fix OOB tex coord indices Previously we set the texture coordinate to zero, now we set the texture coordinate index to zero. This fixes the ripple effect of the Mario painting in Luigi's Mansion. Co-authored-by: Pokechu22 <Pokechu022@gmail.com>	2021-05-07 16:28:09 -07:00
Pokechu22	002ff4e4dd	PixelShaderGen: Remove unused num_texgens argument It became unused in `f039149198`.	2021-05-07 16:28:08 -07:00
Pokechu22	c3668e179c	Split TevStageIndirect::mid into matrix_index and matrix_id	2021-05-07 16:27:52 -07:00
sspacelynx	aba9cae5ab	DriverDetails: Fix broken vector bitwise AND on Mali drivers	2021-04-11 15:15:02 +02:00
Pokechu22	70f9fc4e75	Convert BPMemory to BitField and enum class Additional changes: - For TevStageCombiner's ColorCombiner and AlphaCombiner, op/comparison and scale/compare_mode have been split as there are different meanings and enums if bias is set to compare. (Shift has also been renamed to scale) - In TexMode0, min_filter has been split into min_mip and min_filter. - In TexImage1, image_type is now cache_manually_managed. - The unused bit in GenMode is now exposed. - LPSize's lineaspect is now named adjust_for_aspect_ratio.	2021-03-06 19:27:19 -08:00
Pokechu22	aab81d5aa0	Convert XFMemory to BitField and enum class Additionally a new ClipDisable union has been added (though it is not currently used by Dolphin).	2021-03-06 19:27:14 -08:00
Stenzek	51724c1ccd	LightingShaderGen: Always calculate lighting for both color channels Cel-damage depends on lighting being calculated for the first channel even though there is no color in the vertex format (defaults to the material color). If lighting for the channel is not enabled, the vertex will use the default color as before. The default value of the color is determined by the number of elements in the vertex format. This fixes the grey cubes in Super Mario Sunshine. If the color channel count is zero, we set the color to black before the end of the vertex shader. It's possible that this would be undefined behavior on hardware if a vertex color index that was greater than the channel count was used within TEV.	2020-11-20 15:54:04 -08:00
Lioncash	3d9b2aa005	VideoCommon: Migrate over to fmt Migrates off the printf-based formatting where applicable.	2020-11-17 21:23:58 -05:00
Lioncash	a5b28f1f07	ShaderGenCommon: Rename WriteFmt() to Write() Now that we've converted all of the shader generators over to using fmt, we can drop the old Write() member function and perform a rename operation on the WriteFmt() to turn it into the new Write() function. All changes within this are the removal of a <cstdarg> header, since the previous printf-based Write() required it, and renaming. No functional changes are made at all.	2020-11-09 02:31:49 -05:00
Lioncash	dc72edf0e2	UberShaderPixel: Migrate over to fmt Completes the migration over to using the fmt-formatting WriteFmt function. The next PR will rename all usages of WriteFmt, while simultaneously getting rid of the old printf code.	2020-10-27 13:25:11 -04:00
Lioncash	5de2244acf	PixelShaderGen: Migrate over to fmt Continues the migration of the shader generators over to fmt. After this, all that remains in the Ubershader pixel generator.	2020-10-23 17:17:55 -04:00
Lioncash	9bd533ebe4	VideoCommon/BoundingBox: Make interface for querying bounding box data Rather than expose the bounding box members directly, we can instead provide an interface for code to use. This makes it nicer to transition from global data, as the interface function names are already in place.	2019-12-05 11:48:42 -05:00
Connor McLaughlin	6b4ca318c9	Merge pull request #8139 from lioncash/const VideoCommon/PixelShaderGen: Make look-up table arrays immutable where applicable	2019-06-08 20:03:02 +10:00
Lioncash	149a97e396	VideoCommon: Remove unnecessary memset on ShaderUid instances. Zero-initialization zeroes out all members and padding bits, so this is safe to do. While we're at it, also add static assertions that enforce the necessary requirements of a UID type explicitly within the ShaderUid class. This way, we can remove several memset calls around the shader generation code that makes sure the underlying UID data is zeroed out. Now our ShaderUid class enforces this for us, so we don't need to care about it at the usage sites.	2019-05-30 06:41:54 -04:00
Lioncash	8537f8e095	VideoCommon/PixelShaderGen: Fix up wonky comment formatting This is likely a remnant of the initial transition to clang-format.	2019-05-29 22:03:02 -04:00
Lioncash	45934dd0c5	VideoCommon/PixelShaderGen: Make arrays immutable Many of the arrays defined within this file weren't declared as immutable, which can inhibit the strings being put into the read-only segment. We can declare them constexpr to make them immutable. While we're at it, we can use std::array, to allow bounds conditional bounds checking with standard libraries. The declarations can also be shortened in the future when all platform toolchain versions we use support std::array deduction guides. Currently macOS and FreeBSD builders fail on them.	2019-05-29 22:00:58 -04:00
Lioncash	e28d08e973	VideoCommon/PixelShaderGen: Brace control structures that travel more than one line Them's the code formatting rules.	2019-05-29 20:16:41 -04:00
weihuoya	e98f43d2af	bbox minor fx	2019-05-09 17:30:17 +08:00
Stenzek	96aa762d1e	PixelShaderGen: Don't emit bounding box shader code for old UIDs when disabled If bounding box is enabled when a UID cache is created, then later disabled, we shouldn't emit the bounding box portion of the shader. Fixes pipeline creation errors on D3D12 backend for this case.	2019-04-20 23:16:14 +10:00
Techjar	a318c55cec	PixelShaderGen: Fix bounding box coordinates being offset by 1 pixel	2019-04-03 10:09:05 -04:00
Stenzek	d66d778bae	PixelShaderGen: Use subgroup reduction operations for bounding box	2019-03-29 20:06:56 +10:00
Stenzek	16294acd2a	VideoBackends: Scale bounding box rectangle in the pixel shader	2019-03-25 18:47:58 +10:00
Stenzek	f039149198	Move most backend functionality to VideoCommon	2019-02-19 16:57:54 +10:00
Stenzek	1d61041985	ShaderGen: Don't use interface blocks on Vulkan without GS Doing so causes the Adreno driver to choke and spew errors about too many output locations/components, when clearly we're under the limit.	2019-01-24 17:02:17 +10:00
Stenzek	68cb24172b	ShaderGen: Omit some unused varyings when possible Removes the clipPos varying unless slow-depth is used, and the clipDistance varyings if geometry shaders are not used.	2019-01-23 18:34:22 +10:00
Stenzek	0c0d66809d	PixelShaderGen: Split bbox into seperate variables The Metal shader compiler fails to compile the atomic instructions when operating on individual components of a vector. Spltting it into four variables shouldn't make any difference for other platforms, as they are accessed independently.	2018-11-07 05:41:09 -08:00
Tillmann Karras	56fdcf5f00	VideoCommon: remove unnecessary floor() floatindex is clamped to the range [0, 9]. For non-negative numbers floor() is equivalent to trunc(). Truncation happens implicitly when converting to uint, so the floor() is unnecessary.	2018-10-09 00:31:43 +01:00
Pierre Bourdon	95c2a92f26	Revert "ShaderGen: Drop broken fragment shader index workaround for Vulkan"	2018-09-01 05:32:56 +02:00
Stenzek	3ad7812b53	ShaderGen: Drop broken fragment shader index workaround for Vulkan AMD appears to have since fixed this in their driver, and it makes shadergen ever so slightly less messy.	2018-08-28 23:39:47 +10:00
Stenzek	bae1168fe9	PixelShaderGen: Ensure all components of ocol1 are initialized This was causing a warning in the shader compiler, as the rgb components were not initialized. Which shouldn't be an issue, as the rgb is not used in the blend equation, only the alpha. However, the lack of initialization causes crashes in Intel's D3D shader compiler, so we'll play nice and initialize all the channels.	2018-08-24 23:53:00 +10:00
Stenzek	57976c947b	ShaderGen: Don't emit integer outputs when logic op is unsupported This may have been causing issues for D3D10 hardware, where logic op was not supported.	2018-05-26 00:09:29 +10:00
Stenzek	9a5c2119e5	ShaderCache: Remove unused UID bits before inserting into shader map	2018-05-26 00:09:10 +10:00
spycrab	40bb9974f2	Reformat all the things!	2018-04-12 21:28:39 +02:00
Lioncash	f10198500e	PixelShaderGen: Invert conditional in WriteColor() Puts the true condition body first instead of the false one for better reading.	2018-03-27 18:53:17 -04:00
Lioncash	c9e0045881	PixelShaderGen: Remove dead code in WriteColor() The outer conditional executes only whenever destination alpha is used.	2018-03-27 18:46:25 -04:00
Lioncash	a52cc8d52b	PixelShaderGen/UberShaderPixel: Silence -Wmissing-braces warnings	2018-03-23 10:06:27 -04:00
Lioncash	50a476c371	Assert: Uppercase assertion macros Macros should be all upper-cased. This is also kind of a wart that's been sticking out for quite a while now (we avoid prefixing underscores).	2018-03-14 22:03:12 -04:00
Stenzek	340ee8fff8	PixelShaderGen: Implement table-based fog range as in software renderer	2018-02-15 22:19:21 +10:00
Jonathan Hamilton	ceb1f8c8cb	Enable shader_framebuffer_fetch blend path on ubershaders Tested on a linux Intel Skylake integrated graphics with blend_func_extended force-disabled, as it's the only platform I have that doesn't crash with ubershaders and supports fb_fetch	2018-01-05 09:56:46 -08:00
Jonathan Hamilton	8d68adcaf3	Workaround qualcomm driver bug It seems it doesn't like modifying inout variables in place - so instead use a temporary for ocol0/ocol1 and only write them once at the end of the shader	2018-01-05 09:56:46 -08:00
Jonathan Hamilton	29a9ed043b	Implement dual-source blending in shader For some GLES drivers that don't support dual-source blending, but do support GL_EXT_shader_framebuffer_fetch, this might be useful.	2018-01-05 09:56:46 -08:00
Stenzek	efb9759862	LightingShaderGen: Always calculate lighting for both color channels Cel-damage uses the color from the lighting stage of the vertex pipeline as texture coordinates, but sets numColorChans to zero. We now calculate the colors in all cases, but override the color before writing it from the vertex shader if numColorChans is set to a lower value.	2017-11-22 01:52:18 +10:00
Stenzek	b573319f8f	D3D: Fix shader compile error with logicop and alpha test enabled	2017-09-07 11:48:01 +10:00
Stenzek	84f8ebd95f	VideoBackends: Clear uid bits that are unused for the current backend Currently, this is only the logic op bit, but this will be extended to the framebuffer fetch/blend modes. In the future, when/if we move to VideoCommon pipelines, this state will be part of the pipeline UID anyway, and we can mask it out in the backend by using a two-level map, so the shaders/programs are shared.	2017-09-05 23:49:42 +10:00
Stenzek	0622979d3b	ShaderGen: Support writing integer colors when logic op is enabled This is required for D3D to support logic op.	2017-09-03 16:33:25 +10:00
Stenzek	852e30bb8c	Ubershaders: Fix 6-bit color truncation not being applied	2017-08-01 00:01:09 +10:00
Stenzek	d54ca2a16a	PixelShaderGen: Don't include host early z support in UID Disable it at shader generation time instead. This way the UIDs are sharable.	2017-07-30 17:43:59 +10:00
Stenzek	e968c191ff	Ubershaders: Support per-pixel lighting	2017-07-30 17:43:59 +10:00
Stenzek	e17efb1d8d	ShaderGen: Use consistent variable names for texture coordinates	2017-07-30 17:43:59 +10:00
Stenzek	81b4ed2a81	Vulkan: Uber shader support	2017-07-30 17:43:59 +10:00
Stenzek	7d78cf0f6f	ShaderGen: Implement pixel ubershaders	2017-07-30 17:43:59 +10:00
Stenzek	07591e7d5c	PixelShaderGen: Move header generation to seperate function	2017-07-30 17:43:59 +10:00
Scott Mansell	479abde9f4	BPMemory: Convert a number of unions to BitFields	2017-07-30 17:43:59 +10:00
Stenzek	3ea9d86faa	ShaderGen: Pass host config to shader generation functions Also moves the host config checks to common.	2017-07-20 17:54:33 +10:00
Stenzek	82c27182a8	ShaderGen: Remove host state from shader uids	2017-07-20 17:46:59 +10:00
Jules Blok	8476d0271d	PixelShaderGen: Add missing inputs. This fixes a linkage warning in D3D because of mismatched inputs.	2017-07-05 22:43:39 +02:00
Jonathan Hamilton	35caf48d7d	Add a workaround for a PowerVR glsl compiler bug It seems it doesn't like negating arguments for bitops, so suff like "var <<= (-othervar);" fails to compile	2017-05-03 19:33:56 -07:00
Michael Maltese	134317e07f	videoconfig: add BBoxPreferStencilImplementation @JMC47 requested this to be able to do performance comparisons.	2017-03-15 17:41:33 -07:00
Michael Maltese	de940a5fd6	VideoConfig: add bSupportsFragmentStoresAndAtomics	2017-03-15 17:20:47 -07:00
Tillmann Karras	8a72e84ece	VideoCommon: remove unnecessary trunc()	2017-03-05 01:53:54 +00:00
Lioncash	273ace7bb7	LightingShaderGen: Remove unnecessary includes	2017-02-01 01:06:00 -05:00
degasus	41b0c74e30	VideoCommon: Make dst_alpha state implicit.	2017-01-04 20:02:31 +01:00
Stenzek	5aaa5141ed	PixelShaderGen: Fix incorrect indexing of tevLerpBias Fixes the white box in the fortune street fifolog.	2016-12-13 00:03:25 +10:00
Jules Blok	2058a49465	PixelShaderGen: Remove unused global variable.	2016-10-31 15:02:08 +01:00
Jules Blok	086f839435	DriverDetails: Make the bug identifiers humanly readable.	2016-10-31 15:02:08 +01:00
Jules Blok	d0e60492cf	PixelShaderGen: Don't use a global variable.	2016-10-30 00:48:02 +02:00
Jules Blok	ce9f717045	OGL: Fall back to the old dual-source blending behaviour.	2016-10-29 18:00:22 +02:00
Jules Blok	122ce96ed1	PixelShaderGen: Don't use dual source blend if unsupported.	2016-10-24 00:11:55 +02:00
Jules Blok	5c406b5ef0	PixelShaderGen: Fix implicit type conversions.	2016-10-23 19:54:28 +02:00
Jules Blok	93109df654	PixelShaderGen: Use dual source blending to do 8-bit blending.	2016-10-10 15:34:55 +02:00
Jules Blok	d61ae5a88a	PixelShaderGen: Only set dither uid bit if actually used.	2016-10-10 14:32:21 +02:00
Jules Blok	365e88e311	VideoConfig: Add an option to force RGB8 EFB format.	2016-10-10 14:31:53 +02:00
Jules Blok	31a4ee7bfc	PixelShaderGen: Always truncate the alpha channel to 6-bit.	2016-10-10 14:30:14 +02:00
Jules Blok	9bc466a443	PixelShaderGen: Add support for dithering.	2016-10-10 14:30:14 +02:00
Jules Blok	b958457e51	PixelShaderGen: Add support for RGBA6 EFB format truncation.	2016-10-10 14:27:30 +02:00
degasus	6d2827ff6b	PixelShader: Drop redundant warning. This warning was raised both on UID generation and on shader generation.	2016-10-04 10:13:46 +02:00
degasus	083be0832f	LightingShader: Drop xfmem usage. Our shaders must only depend on the UID, not on any global state.	2016-10-04 10:13:46 +02:00
degasus	829fc8f0ad	PixelShaderGen: Drop dstAlphaMode constant in shader generation. It is already stored within the UID.	2016-10-04 10:13:46 +02:00
Stenzek	bac8c2d441	Vulkan: Work around indexed fragment output bug on AMD drivers	2016-10-01 02:40:02 +10:00
Stenzek	a71381e80a	VideoCommon: Add APIType entry for Vulkan	2016-10-01 01:09:11 +10:00
Stenzek	d9c034e8cc	ShaderGen: Specify attribute/output locations/bindings explicitly This also shifts the SSBO index from index 3 to index 0.	2016-10-01 01:09:11 +10:00
Jules Blok	da43f9a156	PixelShaderGen: Always divide the texture coordinates by q. Some games will set q to a different value than 1.0 through texture matrix manipulations. It seems the console will still do the division in that case.	2016-09-07 21:13:16 +02:00
Lioncash	14e0b48ae4	VideoCommon: Make API_TYPE an enum class Allows for forward declarations in most places, which prevents dumping unrelated VideoCommon.h contents directly into headers.	2016-07-29 19:20:16 -04:00
Scott Mansell	d958388617	VideoCommon: Fix some warnings.	2016-06-26 23:10:31 +12:00
Scott Mansell	24e5d21780	Multithreadded Shadergen: Second pass over Pixel Shadergen. Note: It's not 100% perfect, as some of the GPU capablities leak into the pixel shader UID. Currently our UIDs don't get exported, so there is no issue. But someone might want to fix this in the future.	2016-06-26 16:13:21 +12:00
Scott Mansell	1a831cfc7d	Multithreadded Shadergen: Second Pass over vertex/lighting Shadergens As much as possible, the asserts have been moved out of the GetUID function. But there are some places where asserts depend on variables that aren't stored in the shader UID.	2016-06-26 16:13:21 +12:00
Scott Mansell	4969415f38	Remove global refrences from common code. Bug Fix: Previously vertex shaders and geometery shaders didn't track antialaising state in their UIDs, which could cause AA bugs on directx.	2016-06-26 16:13:20 +12:00
Scott Mansell	0d996f512b	Multithreadded Shadergen: First pass over pixel Shadergen Bug Fix: It was theoretically possible for a shader with depth writes disabled to map to the same UID as a shader with late depth writes. No known test cases trigger this.	2016-06-26 16:13:20 +12:00
Scott Mansell	e99364c7c9	UID Change: Fix bug with indirect stage UIDs Bug Fix: The normal stage UIDs were randomly overwriting indirect stage texture map UID fields. It was possible for multiple shaders with diffrent indirect texture targets to map to the same UID. Once again, it dpesn't look like this bug was ever triggered.	2016-06-26 16:13:19 +12:00
Scott Mansell	03f2c9648d	Shader UID change: Only store the two bits of components we need. This frees up 21 bits and allows us to shorten the UID struct by an entire 32 bits. It's not strictly needed (as it's encoded into the length) but I added a bit for per-pixel lighiting to make my life easier in the following commits.	2016-06-26 16:13:19 +12:00
Scott Mansell	53c402dbc5	Multithreadded Shadergen: First Pass over vertex/lighting Shadergens The only code which touches xfmem is code which writes directly into uid_data. All the rest now read their parameters out of uid_data. I also simplified the lighting code so it always generated seperate codepaths for alpha and color channels instead of trying to combine them on the off-chance that the same equation works for all 4 channels. As modern (post 2008) GPUs generally don't calcualte all 4 channels in a single vector, this optimisation is pointless. The shader compiler will undo it during the GLSL/HLSL to IR step. Bug Fix: The about optimisation was also broken, applying the color light equation to the alpha light channel instead of the alpha light euqation. But doesn't look like anything trigged this bug.	2016-06-26 16:13:19 +12:00
Pierre Bourdon	5fcb4bb3ab	Further fixes to the formatting change. WX sucks.	2016-06-24 12:16:10 +02:00

1 2 3 4 5 ...

324 Commits