dolphin

Commit Graph

Author	SHA1	Message	Date
skidau	a5674bbe84	Merge pull request #475 from kayru/d3d_state_cache D3D: Implemented cache for dynamic render states	2014-10-18 13:11:39 +11:00
comex	4134a0ad54	Make some variables static (should probably adjust for coding style too, but I'm not the one who merged code with bad style...)	2014-10-16 17:03:37 -04:00
comex	1ff86a4716	Merge pull request #1295 from crudelios/remove-bbox-settings Remove setting to enable or disable Bounding Box calculation.	2014-10-16 17:00:37 -04:00
degasus	8f403696ea	DriverDetails: mark intel buffer_storage bug as fixed	2014-10-16 22:51:32 +02:00
skidau	3023abc1b5	Merge pull request #1285 from degasus/master PixelShaderGen: replace multiplication with shift	2014-10-16 14:04:25 +11:00
Yuriy O'Donnell	2e4667caaa	D3D: Moved render state cache into separate source files. Refactored StateCache::Get() to early out for narrower indentation. Added comments to clarify ownership of objects returned by StateCache::Get().	2014-10-15 20:22:39 +02:00
crudelios	d281b4d7e1	Remove setting to enable or disable Bounding Box calculation.	2014-10-15 19:02:54 +01:00
skidau	8ef21bc5e2	Merge pull request #1272 from RisingFog/sconfig-dump-frames Move bDumpFrames to SConfig (and it's references)	2014-10-15 13:42:37 +11:00
Markus Wick	1227bd2ba6	PixelShaderGen: replace multiplication with shift iirc both nvidia and i965 doesn't optimize this	2014-10-14 12:34:37 +02:00
skidau	9551650c42	Merge pull request #1095 from crudelios/sw-bbox Reimplement Bounding Box calculation using the software renderer.	2014-10-13 15:57:11 +11:00
Fog	8d424b114a	Move bDumpFrames to SConfig (and it's references)	2014-10-12 23:56:16 -04:00
Fog	cd0c784d5a	Changed Dump Frames References	2014-10-12 19:51:13 -04:00
skidau	18c81dbc33	Merge pull request #1261 from lioncash/mesa-resonance-cascade AVIDump: Add missing CoreTiming header	2014-10-12 14:26:23 +11:00
Lioncash	e283839b68	AVIDump: Add missing CoreTiming header Fixes build on the mesa buildbot	2014-10-11 23:12:19 -04:00
skidau	a00ad6871c	Merge pull request #1220 from RisingFog/avsync Proper Audio/Video Dumping	2014-10-12 14:04:45 +11:00
crudelios	1e3b9ecdc1	Fix compile errors after rebase.	2014-10-10 12:44:44 +01:00
crudelios	176ea06e82	Get buildbot to compile.	2014-10-10 12:28:15 +01:00
crudelios	47c67f014f	Fix linux build and various warnings. Increase savestate version.	2014-10-10 12:28:13 +01:00
crudelios	2d4b7e3f3f	Reimplement Bounding Box calculation using the software renderer.	2014-10-10 12:27:06 +01:00
Fog	fc4125cdd1	Proper Audio/Video Dumping	2014-10-09 00:06:04 -04:00
Jules Blok	39f421d45d	Support the borderless fullscreen option in all backends.	2014-10-07 16:48:43 +02:00
Jules Blok	7344f752b7	Replace BorderlessFullscreenEnabled by ExclusiveFullscreenEnabled. Special handling was associated with this function, which only applies to exclusive fullscreen.	2014-10-07 16:43:32 +02:00
Lioncash	16a74a9557	Fifo: Fix tab/space mismatches	2014-10-06 20:04:57 -04:00
Lioncash	7c05d029d3	Merge pull request #1085 from waddlesplash/refactoring Migrate global init stuff into UICommon.	2014-10-05 21:25:44 -04:00
Augustin Cavalier	19109e2d01	Migrate global init stuff into UICommon. This avoids code duplication in a bunch of places . I also moved the NVIDIA Optimus export into VideoCommon.	2014-10-05 20:47:37 -04:00
comex	7f6284c2fc	Change a bunch of reference function arguments to pointers. Per the coding style and sanity.	2014-10-02 03:00:33 -04:00
Rohit Nirmal	ce8a4f5cc5	VideoCommon: Silence -Wmaybe-uninitialized warnings.	2014-09-30 16:14:18 -04:00
Tony Wasserka	13fc8e7df1	Merge pull request #578 from RachelBryk/IR Cleanup Renderer::CalculateTargetSize(), and allow IRs higher than 4x to be set via INI.	2014-09-30 19:21:21 +02:00
comex	2eebdff01b	Remove useless STACKALIGN macro. It only ever did anything on 32-bit OS X. Anyway, it wasn't even on the right functions, and these days ABI_PushRegistersAndAdjustStack should handle maintaining the ABI correctly.	2014-09-30 01:42:47 -04:00
comex	87a95727cd	ReadDataFromFifo is always called with len = 32. Remove the parameter to enable optimizations. And rename some variables around it to be less confusing.	2014-09-29 22:07:16 -04:00
skidau	007ba13cfa	Merge pull request #1144 from skidau/fifo-linked Moved the linking of the FIFO CPWritePointer near where CPWritePointer gets updated	2014-09-29 13:52:33 +10:00
comex	6c0a68d507	Add the override config option. I hate the config code, but now is not the time to fix it...	2014-09-28 21:34:31 -04:00
comex	3a2048ea57	Add a central variable g_want_determinism which controls whether to try to make things deterministic. It now affects the GPU determinism mode as well as some miscellaneous things that were calling IsNetPlayRunning. Probably incomplete. Notably, this can change while paused, if the user starts recording a movie. The movie code appears to have been missing locking between setting g_playMode and doing other things, which probably had a small chance of causing crashes or even desynced movies; fix that with PauseAndLock. The next commit will add a hidden config variable to override GPU determinism mode.	2014-09-28 21:34:31 -04:00
comex	65af90669b	Add the 'desynced GPU thread' mode. It's a relatively big commit (less big with -w), but it's hard to test any of this separately... The basic problem is that in netplay or movies, the state of the CPU must be deterministic, including when the game receives notification that the GPU has processed FIFO data. Dual core mode notifies the game whenever the GPU thread actually gets around to doing the work, so it isn't deterministic. Single core mode is because it notifies the game 'instantly' (after processing the data synchronously), but it's too slow for many systems and games. My old dc-netplay branch worked as follows: everything worked as normal except the state of the CP registers was a lie, and the CPU thread only delivered results when idle detection triggered (waiting for the GPU if they weren't ready at that point). Usually, a game is idle iff all the work for the frame has been done, except for a small amount of work depending on the GPU result, so neither the CPU or the GPU waiting on the other affected performance much. However, it's possible that the game could be waiting for some earlier interrupt, and any of several games which, for whatever reason, never went into a detectable idle (even when I tried to improve the detection) would never receive results at all. (The current method should have better compatibility, but it also has slightly higher overhead and breaks some other things, so I want to reimplement this, hopefully with less impact on the code, in the future.) With this commit, the basic idea is that the CPU thread acts as if the work has been done instantly, like single core mode, but actually hands it off asynchronously to the GPU thread (after backing up some data that the game might change in memory before it's actually done). Since the work isn't done, any feedback from the GPU to the CPU, such as real XFB/EFB copies (virtual are OK), EFB pokes, performance queries, etc. is broken; but most games work with these options disabled, and there is no need to try to detect what the CPU thread is doing. Technically: when the flag g_use_deterministic_gpu_thread (currently stuck on) is on, the CPU thread calls RunGpu like in single core mode. This function synchronously copies the data from the FIFO to the internal video buffer and updates the CP registers, interrupts, etc. However, instead of the regular ReadDataFromFifo followed by running the opcode decoder, it runs ReadDataFromFifoOnCPU -> OpcodeDecoder_Preprocess, which relatively quickly scans through the FIFO data, detects SetFinish calls etc., which are immediately fired, and saves certain associated data from memory (e.g. display lists) in AuxBuffers (a parallel stream to the main FIFO, which is a bit slow at the moment), before handing the data off to the GPU thread to actually render. That makes up the bulk of this commit. In various circumstances, including the aforementioned EFB pokes and performance queries as well as swap requests (i.e. the end of a frame - we don't want the CPU potentially pumping out frames too quickly and the GPU falling behind), SyncGPU is called to wait for actual completion. The overhead mainly comes from OpcodeDecoder_Preprocess (which is, again, synchronous), as well as the actual copying. Currently, display lists and such are escrowed from main memory even though they usually won't change over the course of a frame, and textures are not even though they might, resulting in a small chance of graphical glitches. When the texture locking (i.e. fault on write) code lands, I can make this all correct and maybe a little faster. This suggests an alternate determinism method of just delaying results until a short time before the end of each frame. For all I know this might mostly work - I haven't tried it - but if any significant work hinges on the competion of render to texture etc., the frame will be missed.	2014-09-28 21:34:29 -04:00
comex	2d4b7c5900	Make ReadDataFromFifo static.	2014-09-28 21:25:12 -04:00
comex	0ae9e398c8	Rejigger some FIFO buffer variables to be more rational. videoBuffer -> s_video_buffer size -> s_video_buffer_write_ptr g_pVideoData -> g_video_buffer_read_ptr (impl moved to Fifo.cpp) This eradicates the wonderful use of 'size' as a global name, and makes it clear that s_video_buffer_write_ptr and g_video_buffer_read_ptr are the two ends of the FIFO buffer s_video_buffer. Oh, and remove a useless namespace {}.	2014-09-28 21:25:12 -04:00
comex	e86ddacb18	Changes to allow LoadCPReg to work in a preprocess mode which affects a separate state. This state will be used to calculate sizes for skipping over commands on a separate thread. An alternative to having these state variables would be to have the preprocessor stash "state as we go" somewhere, but I think that would be much uglier. GetVertexSize now takes an extra argument to determine which state to use, as does FifoCommandRunnable, which calls it. While I'm modifying FifoCommandRunnable, I also change it to take a buffer and size as parameters rather than using g_pVideoData, which will also be necessary later. I also get rid of an unused overload.	2014-09-28 21:25:06 -04:00
comex	f0131c2e09	Mechanical changes to move most CP state to a struct rather than separate globals. The next commit will add a separate copy of the struct and the ability for LoadCPReg to work on it.	2014-09-28 21:23:29 -04:00
comex	90638c6806	Switch to an unordered_map as a micro-optimization.	2014-09-28 21:23:29 -04:00
comex	f8452ff501	Fix threading issue with vertex loader JIT. VertexLoader::VertexLoader was setting loop_counter, a static variable, to 0. This was nonsensical, but harmless until I started to run it on a separate thread, where it had a chance of interfering with a running vertex translator. Switch to just using a register for the loop counter.	2014-09-28 21:23:28 -04:00
comex	63c62b277d	Some changes to VertexLoaderManager: - Lazily create the native vertex format (which involves GL calls) from RunVertices rather than RefreshLoader itself, freeing the latter to be run from the CPU thread (hopefully). - In order to avoid useless allocations while doing so, store the native format inside the VertexLoader rather than using a cache entry. - Wrap the s_vertex_loader_map in a lock, for similar reasons.	2014-09-28 21:23:28 -04:00
skidau	275226c2b6	Merge pull request #1147 from RachelBryk/unicode-tex Allow custom textures to load from unicode paths.	2014-09-28 14:54:56 +10:00
Rachel Bryk	4fe1119e52	Cleanup Renderer::CalculateTargetSize(), and allow IRs higher than 4x to be set via ini.	2014-09-25 19:50:25 -04:00
skidau	539f270c67	Added a xf.numtexgen != bp.numtextgen error log if there is a mismatch detected.	2014-09-24 10:46:09 +10:00
skidau	b4399dbdf3	Fixed the "Undeclared identifier: uv0" OpenGL shader compile error that appears in NBA2K11.	2014-09-24 00:10:45 +10:00
Rachel Bryk	4ed9b561bd	Allow custom textures to load from unicode paths.	2014-09-22 12:51:30 -04:00
skidau	8c5e12cf02	Moved the linking of the FIFO CPWritePointer near where CPWritePointer gets updated. The CPWritePointer was getting updated while it was in-flight causing Pac-man Party to flicker. Fixes issue 5223.	2014-09-22 16:49:09 +10:00
Tony Wasserka	1d23c2ca8b	GPU: Only load the relevant color components upon writes to the tev color registers. The other two components need not be valid upon write, hence loading them results in glitches. Fixes issue 6783.	2014-09-21 10:38:22 +02:00
skidau	536582b2eb	Merge pull request #1129 from lioncash/casing VideoCommon: Fix function casing in FrameBufferManagerBase	2014-09-21 15:56:17 +10:00
Lioncash	a6ffa55215	VideoCommon: Fix function casing in FrameBufferManagerBase	2014-09-20 14:54:59 -04:00

1 2 3 4 5 ...

2146 Commits