dolphin

Commit Graph

Author	SHA1	Message	Date
Tony Wasserka	1d23c2ca8b	GPU: Only load the relevant color components upon writes to the tev color registers. The other two components need not be valid upon write, hence loading them results in glitches. Fixes issue 6783.	2014-09-21 10:38:22 +02:00
skidau	536582b2eb	Merge pull request #1129 from lioncash/casing VideoCommon: Fix function casing in FrameBufferManagerBase	2014-09-21 15:56:17 +10:00
Lioncash	a6ffa55215	VideoCommon: Fix function casing in FrameBufferManagerBase	2014-09-20 14:54:59 -04:00
Lioncash	91438fa9e7	VideoCommon: Make zfreeze in GenMode 1 bit in size	2014-09-20 14:30:41 -04:00
Ryan Houdek	eb23882398	Merge pull request #1120 from rohit-n/muh-precompiled-headers Fix build failing when disabling precompiled headers.	2014-09-19 17:43:42 -05:00
Rohit Nirmal	46057db37d	Fix build failing when disabling precompiled headers.	2014-09-19 18:17:51 -04:00
magumagu	32e5043b29	WIP XFB scaling. Still an ugly mess.	2014-09-19 12:33:15 -05:00
Lioncash	b06ec302d1	Remove some unnecessary semicolons	2014-09-11 13:05:31 -04:00
Ryan Houdek	71cb09f1ca	Merge pull request #1027 from rohit-n/change-include Include CommonTypes.h instead of Common.h.	2014-09-10 00:35:16 -05:00
skidau	d1439bc1db	Merge pull request #1041 from RachelBryk/kill-g_CoreStartupParameter Kill Core::g_CoreStartupParameter.	2014-09-10 11:00:42 +10:00
Rachel Bryk	f93aa7087c	Kill Core::g_CoreStartupParameter.	2014-09-09 00:24:49 -04:00
Fiora	94c20db369	Rename Log2 and add IsPow2 to MathUtils for future use Also remove unused pow2/pow2f functions.	2014-09-08 20:15:45 -07:00
Rohit Nirmal	fbc64984ca	Include CommonTypes.h instead of Common.h.	2014-09-08 15:39:58 -04:00
comex	c5c0b36046	Remove the inaccurately named ABI_PushAllCalleeSavedRegsAndAdjustStack (it didn't preserve FPRs!) and replace with ABI_PushRegistersAndAdjustStack. To avoid FPRs being pushed unnecessarily, I checked the uses: DSPEmitter doesn't use FPRs, and VertexLoader doesn't use anything but RAX, so I specified the register list accordingly. The regular JIT, however, does use FPRs, and as far as I can tell, it was incorrect not to save them in the outer routine. Since the dispatcher loop is only exited when pausing or stopping, this should have no noticeable performance impact.	2014-09-08 01:00:10 -04:00
comex	2dafbfb3ef	Improve code and clarify parameters to ABI_Push/PopRegistersAndAdjustStack. - Factor common work into a helper function. - Replace confusingly named "noProlog" with "rsp_alignment". Now that x86 is not supported, we can just specify it explicitly as 8 for clarity. - Add the option to include more frame size, which I'll need later. - Revert a change by magumagu in March which replaced MOVAPD with MOVUPD on account of 32-bit Windows, since it's no longer supported. True, apparently recent processors don't execute the former any faster if the pointer is, in fact, aligned, but there's no point using MOVUPD for something that's guaranteed to be aligned... (I discovered that GenFrsqrte and GenFres were incorrectly passing false to noProlog - they were, in fact, functions without prologs, the original meaning of the parameter - which caused the previous change to break. This is now fixed.)	2014-09-08 00:58:56 -04:00
shuffle2	227b79bf84	Merge pull request #1004 from comex/warning-fixes-2 Two trivial warning fixes	2014-09-06 11:56:32 -07:00
comex	917c6d324a	Remove unused functions in TextureDecoder.	2014-09-06 13:32:54 -04:00
Rohit Nirmal	629ceaf2b1	Split some parts of UpdateBoundingBox into multiple lines. Also, fix issues causing failure on Lint.	2014-09-06 09:49:27 -05:00
Rohit Nirmal	debe3999b5	Remove more dead and redundant code.	2014-09-05 23:22:48 -05:00
shuffle2	0576046fdd	Merge pull request #972 from Sonicadvance1/fix-intel-windows Work around Intel's failings with buffer_storage	2014-09-05 11:06:49 -07:00
Fiora	07e0c917c6	Revert "JIT64: optimize CA calculations"	2014-09-05 10:26:30 -07:00
comex	97420c6ec6	Merge pull request #852 from FioraAeterna/optimizeca JIT64: optimize CA calculations	2014-09-05 11:52:02 -04:00
shuffle2	a9a6270982	Merge pull request #774 from magcius/texdecode-cleanup Clean up the TextureDecoder and some related things	2014-09-04 19:37:49 -07:00
Jasper St. Pierre	76b4dbdf28	TextureDecoder: Clean up the code style For a long time, we've had ugly and inconsistent function names here as helpers, names like "decodebytesRGB5A3rgba" which are absolutely incomprehensible to understand. Fix this by introducing a new consistent naming scheme, where the above function now becomes "DecodeBytes_RGB5A3".	2014-09-04 18:36:57 -07:00
Jasper St. Pierre	0b7bed4a52	TextureDecoder: Simplify how the reference texture decoder works Instead of having three separate functions and checking the tlutfmt in a variety of places, just do it once in a helper method. This is already for the slow path either in our Generic decoder or in our Software renderer, so it doesn't matter that this is slower. x64 will continue using the separate functions for speed.	2014-09-04 18:36:57 -07:00
Jasper St. Pierre	ea1245d191	TextureDecoder: Pass the TLUT address straight into the texture decoder This removes the requirement for the TextureDecoder to have access to global texture memory.	2014-09-04 18:36:57 -07:00
Jasper St. Pierre	fcd4ecc942	TextureDecoder: Add an enum for the TLUT formats Quick code cleanup. The enum names and values come from libogc.	2014-09-04 18:36:56 -07:00
Jasper St. Pierre	32da01edec	TextureDecoder: Rearrange header slightly Put the two Decode APIs together.	2014-09-04 18:36:56 -07:00
Jasper St. Pierre	f975307016	TextureDecoder: Add some statics to some of our helper functions I know these are already inline, but this makes it more clear that they're helper functions to be used in this file only.	2014-09-04 18:36:56 -07:00
Jasper St. Pierre	a8e591dc73	VideoCommon: Remove support for decoding to ARGB textures The D3D / OGL backends only ever used RGBA textures, and the Software backend uses its own custom code for sampling. The ARGB path seems to just be dead code. Since ARGB and RGBA formats are similar, I don't think this will make the code more difficult to read or unable to be used as reference. Somebody who wants to use this code to output ARGB can simply modify the MakeRGBA function to put the shift at the other end.	2014-09-04 18:36:56 -07:00
Jasper St. Pierre	9438a30384	VideoCommon: Start putting common texture decoding code in TextureDecoder_Common This pulls all the duplicate code from TextureDecoder_Generic / TextureDecoder_x64 out and puts it in a common file. Out custom font used for debugging the texture cache is also pulled out and put in a common "sfont.inc" file. At some point we should also combine this font with the other six binary fonts we ship.	2014-09-04 18:36:53 -07:00
skidau	73fc45db68	Merge pull request #967 from skidau/SyncGPU-SaveState Added the EmuRunningState check to the GPU thread's FIFO loop	2014-09-05 11:24:46 +10:00
Jasper St. Pierre	bfb2c04ace	TextureDecoder: Remove unused function GetPC_TexFormat was never used. It was added in commit `d02426a`, with the only user being commented out code. The commented out code was later removed in `9893122`, but the implementation stayed.	2014-09-04 17:32:06 -07:00
Jasper St. Pierre	6682a2fadd	TextureDecoder: Fix a RGBA/BGRA copy/paste typo We were decoding to BGRA32 textures in our RGBA32 texture decoder. Since this is the same for the BGRA32 decoder implementation, this is most likely a copy/paste typo, rather than the texture actually being bit-swapped. Fix this. I'm not sure of any games that use the C14X2 texture format, so I'm not sure this fixes any games, but it does make the code cleaner for when we clean it up in the future, and merge some of these similar loops.	2014-09-04 17:30:53 -07:00
Jasper St. Pierre	a5297f6da8	PixelEngine: Remove unused AllowIdleSkipping and all references to it	2014-09-04 17:25:59 -07:00
Sonicadvance1	e32b2e1771	Work around Intel's failings with with buffer_storage	2014-09-04 19:03:49 -05:00
Yuriy O'Donnell	d8d9bc8c6c	Render: Implemented simple render target pool This avoids creating and destroying render targets every frame, which is a significant CPU overhead. Old render targets are destroyed after 3 frames.	2014-09-04 22:21:06 +02:00
Dolphin Bot	830a03c540	Merge pull request #957 from degasus/frame_skipping VideoCommon: rewrite frame skipping code	2014-09-04 18:27:19 +02:00
degasus	8b84ddce9a	VideoCommon: rewrite frame skipping code	2014-09-04 18:07:39 +02:00
degasus	ef6f6a7fa9	VideoCommon: remove XFReg copy optimization This code is just ugly and I doubt there is a way that copying twice is faster.	2014-09-04 17:56:17 +02:00
skidau	86db0bf8c3	Added the EmuRunningState check to the GPU thread's FIFO loop so that the GPU thread services any waiting save states. This is needed for games that have the Sync GPU option enabled.	2014-09-04 22:02:21 +10:00
comex	dd5be7c0dc	Merge pull request #924 from comex/fifo-command-runnable Refactor opcode decoding a bit to kill FifoCommandRunnable.	2014-09-02 23:27:30 -04:00
Shawn Hoffman	266992684d	msvc: remove some remnants of SDL and DSound from projects and general cleanup.	2014-09-01 21:27:44 -07:00
Fiora	b51aa4fa89	Rename Log2 and add IsPow2 to MathUtils for future use Also remove unused pow2/pow2f functions.	2014-09-01 20:41:07 -07:00
comex	608f9bcd67	Refactor opcode decoding a bit to kill FifoCommandRunnable. Separated out from my gpu-determinism branch by request. It's not a big commit; I just like to write long commit messages. The main reason to kill it is hopefully a slight performance improvement from avoiding the double switch (especially in single core mode); however, this also improves cycle calculation, as described below. - FifoCommandRunnable is removed; in its stead, Decode returns the number of cycles (which only matters for "sync" GPU mode), or 0 if there was not enough data, and is also responsible for unknown opcode alerts. Decode and DecodeSemiNop are almost identical, so the latter is replaced with a skipped_frame parameter to Decode. Doesn't mean we can't improve skipped_frame mode to do less work; if, at such a point, branching on it has too much overhead (it certainly won't now), it can always be changed to a template parameter. - FifoCommandRunnable used a fixed, large cycle count for display lists, regardless of the contents. Presumably the actual hardware's processing time is mostly the processing time of whatever commands are in the list, and with this change InterpretDisplayList can just return the list's cycle count to be added to the total. (Since the calculation for this is part of Decode, it didn't seem easy to split this change up.) To facilitate this, Decode also gains an explicit 'end' parameter in lieu of FifoCommandRunnable's call to GetVideoBufferEndPtr, which can point to there or to the end of a display list (or elsewhere in gpu-determinism, but that's another story). Also, as a small optimization, InterpretDisplayList now calls OpcodeDecoder_Run rather than having its own Decode loop, to allow Decode to be inlined (haven't checked whether this actually happens though). skipped_frame mode still does not traverse display lists and uses the old fake value of 45 cycles. degasus has suggested that this hack is not essential for performance and can be removed, but I want to separate any potential performance impact of that from this commit.	2014-09-01 14:35:23 -04:00
Pierre Bourdon	494a60e41b	VertexLoader: Change VtxDesc to use u64 instead of u32 This is required to make packing consistent between compilers: with u32, MSVC would not allocate a bitfield that spans two u32s (it would leave a "hole").	2014-09-01 11:18:02 +02:00
Lioncash	4af8d9d248	VideoCommon: Clean up brace placements	2014-08-30 18:06:45 -04:00
Lioncash	1d706b2311	Get rid of C-style empty function parameter indicators	2014-08-30 15:23:48 -04:00
comex	a4a533e39f	Re-enable the vertex loader JIT on OS X. Why was it ever disabled?	2014-08-27 23:50:59 -04:00
comex	e31d6feaa2	Unify three types of non-FIFO requests to the GPU thread around Common::Event and Common::Flag. The only possible functionality change is that s_efbAccessRequested and s_swapRequested are no longer reset at init and shutdown of the OGL backend (only; this is the only interaction any files other than MainBase.cpp have with them). I am fairly certain this was entirely vestigial. Possible performance implications: efbAccessReady now uses an Event rather than spinning, which might be slightly slower, but considering the slow loop the flags are being checked in from the GPU thread, I doubt it's noticeable. Also, this uses sequentially consistent rather than release/acquire memory order, which might be slightly slower, especially on ARM... something to improve in Event/Flag, really.	2014-08-26 12:43:39 -04:00

1 2 3 4 5 ...

2099 Commits