dolphin

Commit Graph

Author	SHA1	Message	Date
Stenzek	517a977444	ShaderCache: Fix several issues in background shader compiling - In D3D, shaders could be compiled on the main thread, blocking startup. - Reduced the latency between a pipeline being requested and used in all backends in hybrid ubershader mode, when no shader stages were present. - Fixed a case where async compilation could cause the same UID to be appended multiple times to the UID cache. - Fix incorrect number of threads being used when immediately compile shaders was enabled.	2018-03-15 01:50:47 +10:00
Stenzek	51e4014a35	DriverDetails: Disable primitive restart on Vulkan with Mali driver	2018-03-14 02:56:24 +10:00
Stenzek	db810956ec	Vulkan: Provide a more accurate method of detecting drivers/vendors This is needed to differentiate between the open-source Mesa drivers and their binary counterparts for Intel and AMD.	2018-03-14 02:48:53 +10:00
Henrik Rydgård	6cbb716f60	SSE-optimize UninitializeXFBMemory Lowest hanging fruit I could find with a profiler. Not sure this stuff actually needs to be done, but assuming it is, why not do it quickly? 10x faster, goes from 1% CPU to 0.09%.	2018-03-13 11:55:28 +01:00
Stenzek	427aa188d4	ShaderCache: Use a version number for pipeline UID caches	2018-03-11 14:48:09 +10:00
Stenzek	e31cc1f679	ShaderCache: Implement background shader compilation This enables shaders to be compiled while the game is starting, instead of blocking startup. If a shader is needed before it is compiled, emulation will block.	2018-03-10 16:11:19 +10:00
Stenzek	9fa24700b6	VideoConfig: Collapse ubershader configuration fields to a single value	2018-03-10 15:56:45 +10:00
Stenzek	590307b94c	ShaderCache: Use memcmp for comparing pipeline UIDs As these are stored in a map, operator< will become a hot function when doing lookups, which happen every frame. std::tie generated a rather large function here with quite a few branches.	2018-03-10 15:56:42 +10:00
Stenzek	41296db083	Renderer: Remove now-redundant Set{Rasterization,Depth,Blending}State	2018-03-10 15:56:40 +10:00
Stenzek	f9c829c7f7	OGL: Re-implement async shader compiling	2018-03-10 15:56:34 +10:00
Stenzek	dec0c3bce8	Move shader caches to VideoCommon	2018-03-10 15:56:30 +10:00
Stenzek	4c24a69710	VideoCommon: Add support for Abstract Framebuffers	2018-03-02 20:20:48 +10:00
Stenzek	2a6d9e4713	AbstractTexture: Add support for depth textures/formats	2018-03-01 17:31:24 +10:00
Stenzek	6374a4c4a8	AbstractTexture: Support multisampled abstract texture	2018-03-01 17:31:24 +10:00
Stenzek	4316f5f56b	AbstractTexture: Add property/attribute accessor helpers	2018-03-01 17:31:24 +10:00
Stenzek	e125eaa237	VideoCommon: Drop references to AbstractRawTexture	2018-03-01 17:31:24 +10:00
Markus Wick	227db66e4f	OGL: Use glBufferData on Mali. tl;dr: This PR speedups dolphin on mobiles with the Mali GPU and ES 3.2 drivers by a factor of 10 by using the method with the biggest overhead. Please keep care not to buy this shit! The ARM driver team seems to care very well about their customers. But bad luck, users and open source developers are not their customers. So even device-independent feature requests are just ignored for years: https://community.arm.com/graphics/f/discussions/4645/gl_ext_buffer_storage-support The bad point, they neither implement any of the other common ways to stream dynamic content in unextented GL: - They just ignore the GL_MAP_UNSYNCHRONIZED_BIT flag - They don't support on-device buffer updates and just stall with glBufferSubData It seems like no benchmark is using any dynamic content - and like no customer cares about anything but benchmarks, or users... We have a flag to disable the glBufferSubData way, this PR adds the flag to also disable the unsychronized mapping way. The second one is available since their ES 3.2 update, but slow as hell. So how to continue? The last remaining technical way to stream dynamic content at all is to alloc a new buffer per draw call with glBufferData. This is very gross, but still a factor 10 speedup compared to stalling the GPU. Small tests shows that you can expect another 3-5 times speedup with EXT_buffer_data, so Mali would be on pair with Adreno here. So if you have bought such a device unfortunately, please try to make noise on your vendor forums/support and ask for this extension. If you are going to buy a new mobile, I'd recormend to avoid any mobile with a Mali GPU in it.	2018-02-25 17:12:36 +01:00
Stenzek	fec6bb4d56	VideoBackends: Add AbstractShader and AbstractPipeline classes	2018-02-22 22:02:34 +10:00
Stenzek	de632fc9c8	Renderer: Handle resize events on-demand instead of polling We now differentiate between a resize event and surface change/destroyed event, reducing the overhead for resizes in the Vulkan backend. It is also now now safe to change the surface multiple times if the video thread is lagging behind.	2018-02-20 01:15:55 +10:00
Stenzek	c1b39ecc58	BPFunctions: Move upscaling of scissor rect to VideoCommon	2018-02-20 00:49:32 +10:00
Stenzek	5359396099	BPFunctions: Move GX viewport conversion to VideoCommon	2018-02-20 00:49:32 +10:00
Stenzek	340ee8fff8	PixelShaderGen: Implement table-based fog range as in software renderer	2018-02-15 22:19:21 +10:00
Stenzek	93fb0e1e1c	TextureCache: Add an option to disable EFB copies to VRAM The option is named DisableCopyToVRAM under the Hacks section in GFX.ini. It is intentionally not exposed to the GUI, as users should not need to use it under normal circumstances. The main use is debugging issues in the EFB-to-RAM shaders.	2018-02-11 15:48:46 +10:00
Stenzek	84b990faa0	VideoConfig: Remove bForceCopyToRam field It's the inverse of supports-copy-to-vram.	2018-02-11 15:29:37 +10:00
Anthony	096131c908	Merge pull request #6334 from stenzek/startup Video Backend Initialization/Core Boot Improvements	2018-02-07 23:35:54 -08:00
Stenzek	260d5b7aa7	BPMemory: Handle fog configuration where both A and C are infinity/NaN The console appears to behave against standard IEEE754 specification here, in particular around how NaNs are handled. NaNs appear to have no effect on the result, and are treated the same as positive or negative infinity, based on the sign bit. However, when the result would be NaN (inf - inf, or (-inf) - (-inf)), this results in a completely fogged color, or unfogged color respectively. We handle this by returning a constant zero for the A varaible, and positive or negative infinity for C depending on the sign bits of the A and C registers. This ensures that no NaN value is passed to the GPU in the first place, and that the result of the fog calculation cannot be NaN.	2018-02-01 17:40:39 +10:00
Stenzek	fe5150cc31	Merge pull request #6303 from TraceBullet/auto-adjust-window-size Fix Auto-Adjust Window Size option making the window too large	2018-01-29 17:28:44 +10:00
Stenzek	c790077c13	VideoBackend: Remove PeekMessages method The video thread and backend no longer create any windows, therefore there will never be any messages dispatched to their thread.	2018-01-27 13:53:55 +10:00
Stenzek	d96e8c9d76	VideoBackends: Combine Initialize/Prepare and Cleanup/Shutdown methods Also allows the work previously done in Prepare to return a failure status.	2018-01-27 13:53:55 +10:00
TraceBullet	ab6f932347	Fix Auto-Adjust Window Size option making the window too large	2018-01-26 10:47:19 -05:00
Stenzek	3f197480ef	Renderer: Fix crash on shutdown when frame dumping or taking screenshots	2018-01-26 12:12:00 +10:00
Stenzek	81ae88d2d5	AbstractTexture: Fix crash in Vulkan backend when freeing texture	2018-01-26 19:12:11 +10:00
Stenzek	38e0b6e2ab	AbstractTexture: Move Bind() method to Renderer This makes state tracking simpler, and enables easier porting to command lists later on.	2018-01-22 13:22:09 +10:00
degasus	a5a0599145	CustomTexture: Drop old texture format.	2018-01-20 17:08:47 +01:00
degasus	0b466249e0	CustomTextures: Drop format convertion.	2018-01-20 16:39:04 +01:00
JosJuice	2441fd28d5	AVIDump: Remove incorrect usage of s_ prefix	2018-01-17 22:19:14 +01:00
Markus Wick	e02025b45e	Merge pull request #6307 from rukai/fix-frame-dump-path Handle framedump path not existing	2018-01-17 22:02:51 +01:00
Lucas Kent	6c7e6016fb	Handle framedump path not existing	2018-01-18 07:53:30 +11:00
Markus Wick	cb7eede193	VideoCommon: Apply custom texture scale for arbitrary mipmaps. We want to get the same mipmap level. And if the IR and the custom texture upscaling fits, we don't need to modify the LOD bias.	2018-01-17 09:02:36 +01:00
Markus Wick	2a43f41ace	Merge pull request #6297 from JosJuice/custom-texture-arb-filename Treat custom textures with "_arb" suffix as having arbitrary mipmaps	2018-01-15 09:58:30 +01:00
Markus Wick	b93ae14272	Merge pull request #6300 from JonnyH/WIP/glsl-es-implicit-int-float-conversions-in-gpu-texture-decode GLSL-ES doesn't allow implicit int/uint conversions	2018-01-11 22:22:05 +01:00
Jonathan Hamilton	46254a2cf2	Some more implicit uint/float conversions in the texture decode shaders	2018-01-11 11:15:40 -08:00
Jonathan Hamilton	f23dd992dd	GLSL-ES doesn't allow implicit int/uint conversions	2018-01-11 10:54:55 -08:00
JosJuice	226b65bd38	Clean up variable naming in HiresTextures::Update	2018-01-10 17:53:51 +01:00
JosJuice	c25fffc9a0	Treat custom textures with "_arb" suffix as having arbitrary mipmaps This is adapted from Bighead's code that was posted at https://forums.dolphin-emu.org/Thread-dolphin-custom-texture-mipmaps?pid=460867#pid460867 In master, custom textures are never treated as having arbitrary mipmaps, so we need either a change like this or a change that makes us apply the arbitrary mipmap heuristic even when a custom texture is used.	2018-01-10 17:51:45 +01:00
Markus Wick	22f469697b	Merge pull request #6290 from JosJuice/invalid-aspect-ratio Treat invalid aspect ratio setting values as Auto	2018-01-08 13:46:30 +01:00
JosJuice	1557e6ab05	Specify underlying types for enums that get casted from integers Otherwise we might get UB if the value we cast is larger than the max value of the underlying type that the compiled picked for the enum. I haven't done any extensive check through Dolphin to find cases of this, I'm just fixing the cases I already know of.	2018-01-08 12:14:18 +01:00
JosJuice	a2404c42a1	Treat invalid aspect ratio setting values as Auto	2018-01-06 12:53:53 +01:00
Markus Wick	56d153f548	VideoCommon: Apply the yscale as upscaling of the XFB.	2018-01-06 10:36:33 +01:00
Jonathan Hamilton	ceb1f8c8cb	Enable shader_framebuffer_fetch blend path on ubershaders Tested on a linux Intel Skylake integrated graphics with blend_func_extended force-disabled, as it's the only platform I have that doesn't crash with ubershaders and supports fb_fetch	2018-01-05 09:56:46 -08:00
Jonathan Hamilton	8d68adcaf3	Workaround qualcomm driver bug It seems it doesn't like modifying inout variables in place - so instead use a temporary for ocol0/ocol1 and only write them once at the end of the shader	2018-01-05 09:56:46 -08:00
Jonathan Hamilton	29a9ed043b	Implement dual-source blending in shader For some GLES drivers that don't support dual-source blending, but do support GL_EXT_shader_framebuffer_fetch, this might be useful.	2018-01-05 09:56:46 -08:00
Vlad Firoiu	330881ae80	Allow users to specify the encoder used for framedumping.	2018-01-03 13:23:10 +01:00
Sepalani	44935c23e0	MAX_XFB_HEIGHT: PAL value off by two fixed	2017-12-30 20:22:10 +01:00
iwubcode	50d96a4411	BPStructs: Use static cast for XFB/EFB copies	2017-12-21 21:19:26 -06:00
iwubcode	e1332b1d7e	Texture Cache Base: Move PAL vertical scaling to be baked into the texture size. This saves on a number of multiplications and fixes an issue where we were multiplying the y-scaling factor by the srcRectangle's height + 1 which was causing a crash in some GC titles	2017-12-21 21:19:26 -06:00
JosJuice	a3355a3e4a	Only use the "Scaled EFB Copy" setting for EFB, not XFB The hybrid XFB PR made it apply to both EFB copies and XFB copies.	2017-12-19 12:59:36 +01:00
Emmanuel Gil Peyrot	d9c2de90c0	Reimplement EFB copy for intensity formats This is the direct continuation of #5887, and removes the last usage of the colmat matrix in EFB copy conversion shaders.	2017-12-19 12:04:56 +01:00
Markus Wick	0bf24f549a	Merge pull request #6190 from JosJuice/is-trivially-copyable-vs Remove IsTriviallyCopyable hack for VS	2017-12-19 11:15:58 +01:00
Markus Wick	5afd07c033	Merge pull request #6254 from iwubcode/frame_counter_and_advance_fix Video Common: frame logic fix	2017-12-19 09:44:49 +01:00
JosJuice	4973ae9952	Rename GFX_HACK_COPY_EFB_ENABLED to GFX_HACK_COPY_EFB_SCALED Not sure why it was named like this... It doesn't affect whether the copy happens or not, only what resolution it uses.	2017-12-18 21:55:02 +01:00
iwubcode	2f7eacd08c	Video Common: move frame advance and counter logic to only happen when we actually render a frame.	2017-12-15 12:46:18 -06:00
Jonathan Hamilton	a38103afbf	Fix GLES GPU texture conversion shader GLES doesn't support C-style array initialisers, so stuff like: Type var[2] = { VALUE_A, VALUE_B }; isn't supported in GLES (it was added in GL_ARB_shading_language_420pack). The texture conversion shader used this, so would fail to compile on GLES.	2017-12-12 22:01:10 +00:00
Markus Wick	a70c03a1c0	Merge pull request #6249 from stenzek/texture-dump Texture dumping regression fixes	2017-12-11 08:44:09 +01:00
Jonathan Hamilton	e401004766	Fix an invalid (uint < int) comparison in GLES with ubershaders	2017-12-10 10:52:32 -08:00
Stenzek	2579f8eaa5	TextureCache: Don't dump custom textures They don't have a basename, therefore were being saved to empty files, and potentially can be compressed.	2017-12-10 15:31:46 +10:00
Leo Lam	ecf30cfc74	Merge pull request #6240 from JosJuice/stream-path-encoding Don't use wrong encoding for paths when opening streams on Windows	2017-12-07 10:04:55 +01:00
degasus	02dd062518	VideoCommon: Drop now unused efb2tex matrix generation.	2017-12-06 09:30:03 +01:00
JosJuice	9d8a82e1d9	Don't use wrong encoding for paths when opening streams on Windows	2017-12-05 21:23:35 +01:00
Stenzek	d0601c0a83	TextureConversionShader: Use round() instead of roundEven() in HLSL HLSL does not define roundEven(), only round(). This means that the output may differ slightly for OpenGL vs Direct3D. However, it ensures consistency across OpenGL drivers, as round() in GLSL can go either way.	2017-12-06 03:33:50 +10:00
Stenzek	beb35320f6	TextureConverterShaderGen: Make generated shaders HLSL-compatible	2017-12-06 03:33:50 +10:00
Markus Wick	8bcc392d5d	Merge pull request #5887 from linkmauve/alpha-fix Simplify EFB copy shaders	2017-12-03 11:21:08 +01:00
Léo Lam	67ba326ff6	VideoCommon: Fix -Wformat warnings	2017-12-02 21:48:01 +01:00
Emmanuel Gil Peyrot	85bb1d7bb0	VideoCommon: Reimplement texture format conversion This fixes the rendering of the scan visor in Metroid Prime 2: Echoes, as seen in https://fifoci.dolphin-emu.org/dff/mp2-scanner/ The alpha channel was off-by-one on Ivy Bridge due to the rounding after multiplication with colmat. This commit removes this matrix altogether in most cases, making them simple GLSL swizzles.	2017-12-02 19:29:01 +01:00
Markus Wick	76fbdbb06e	Merge pull request #5903 from linkmauve/fix-mp2e Round values in swizzlers	2017-12-02 17:38:21 +01:00
degasus	e2a9ee384f	VideoCommon: Rename TextureConversionShader namespace.	2017-12-02 15:17:39 +01:00
degasus	e8febd0cef	VideoCommon: Create a namespace for TextureConversionShaderGen.	2017-12-02 15:17:39 +01:00
Markus Wick	491c10ec96	VideoBackends: Use VideoCommon shader generators for efb2tex copies. This will generate one shader per copy format. For now, it is the same shader with the colmat hard coded. So it should already improve the GPU performance a bit, but a rewrite of the shader generator is suggested. Half of the patch is done by linkmauve1: VideoCommon: Reorganise the shader writes.	2017-12-02 15:17:39 +01:00
Stenzek	cd68b3606c	Merge pull request #6193 from stenzek/readbacks Abstract Staging Textures - VideoCommon interface for texture readbacks/uploads	2017-12-01 14:24:06 +10:00
Markus Wick	96e094e127	Merge pull request #6216 from stenzek/headless-frame-dumping OGL: Fix headless frame dumping	2017-11-26 11:36:10 +01:00
Jules Blok	7dd207931d	UberShaderPixel: Fix the interpolation qualifier for interface blocks. Fixes MSAA on MacOS with ubershaders enabled.	2017-11-25 21:51:14 +01:00
Stenzek	32125cf181	OGL: Fix headless frame dumping Also skips swapping the window system buffers in headless mode, as there may not be a surface which can be swapped in the first place. Instead, we call glFlush() at the end of a frame in this case.	2017-11-23 16:53:55 +10:00
Stenzek	db1d9de933	AbstractTexture: Drop slow map readback path	2017-11-22 18:49:33 +10:00
Stenzek	193763ca3a	TextureCacheBase: Don't crop last row/column of XFB copies Unsure why this was happening in the first place.	2017-11-22 18:49:33 +10:00
Stenzek	6577365851	VideoCommon: Re-implement asynchronous frame dumping This was lost as a result of hybrid XFB, now it is back, and ~10% faster in very brief testing.	2017-11-22 18:49:33 +10:00
Stenzek	c2cc128f1b	AbstractTexture: Implement Save using new common methods	2017-11-22 18:47:04 +10:00
Stenzek	56afebeb44	AbstractTexture: Seperate CopyRectangleFromTexture to two methods ScaleRectangleFromTexture, which does a draw, and CopyRectangleFromTexture, which where possible, does a bit-for-bit copy.	2017-11-22 18:47:04 +10:00
Stenzek	f43d85921d	VideoBackends: Add AbstractStagingTexture class Can be used for asynchronous readback or upload of textures.	2017-11-22 18:47:04 +10:00
Stenzek	a584ccc7d8	AbstractTexture: Support BGRA8 formats Used for some driver's swap chains, and EFB to RAM.	2017-11-22 18:47:04 +10:00
Stenzek	49a9c33bd7	VideoCommon: Move abstract texture creation function to Renderer	2017-11-22 18:47:04 +10:00
Leo Lam	73a67aa413	Merge pull request #6204 from stenzek/downscaled-screenshots Frame Dumping: Fix window-size framedumping	2017-11-22 07:53:23 +01:00
Stenzek	efb9759862	LightingShaderGen: Always calculate lighting for both color channels Cel-damage uses the color from the lighting stage of the vertex pipeline as texture coordinates, but sets numColorChans to zero. We now calculate the colors in all cases, but override the color before writing it from the vertex shader if numColorChans is set to a lower value.	2017-11-22 01:52:18 +10:00
Stenzek	8985cb2f0b	Renderer: Re-implement window size frame dumping This was broken by hybrid XFB, and all frame dumping/screenshots occured at the full internal resolution.	2017-11-21 17:19:43 +10:00
Stenzek	39559f6358	VideoConfig: Remove bSupportsInternalResolutionFrameDumps Field is unused as of Hybrid XFB.	2017-11-21 17:19:43 +10:00
Stenzek	18c1bf19ca	VideoCommon: Use constant for number of color channels in XFMemory	2017-11-20 23:18:44 +10:00
Leo Lam	80bcc0d58d	Merge pull request #6186 from lioncash/enum-class VideoConfig: Make AspectMode and StereoMode enum classes	2017-11-19 15:08:16 +01:00
JosJuice	2ead31c769	Merge pull request #6203 from lioncash/missing-braces-warn VideoCommon: Resolve -Wmissing-brace warnings	2017-11-19 08:45:13 +01:00
JosJuice	2385a03c2b	Merge pull request #6201 from lioncash/headers VideoBackends: Remove header inclusions made unnecessary with Hybrid XFB	2017-11-19 08:44:48 +01:00
Lioncash	518f6a3624	VideoCommon: Resolve -Wmissing-brace warnings Resolves around 5 -Wmissing-brace warnings on macOS.	2017-11-19 01:49:39 -05:00
Lioncash	2bd88bdb9f	AbstractTexture: Remove a redundant constructor initializer list entry This is already initialized in the class definition. This would previously cause a -Wreorder warning on macOS, as m_config is defined after m_currently_mapped.	2017-11-19 01:25:55 -05:00

1 2 3 4 5 ...

3614 Commits