xenia-canary

Commit Graph

Author	SHA1	Message	Date
Radosław Gliński	23ca3725c4	Merge pull request #50 from chrisps/canary_experimental Ton of cpu changes	2022-07-16 19:39:45 +02:00
chss95cs@gmail.com	6a612b4d34	remove useless tag field from hir::Value pack local_slot and constant in hir::Value Instead of loading membase at the start of every function, just load it in HostToGuestThunk vzeroupper in GuestToHostThunk before calling host function, and in HostToGuestThunk after calling function to prevent AVX dirty state slowdowns. In the future, check if CPU implements AVX as 128x2 and skip if so (https://john-h-k.github.io/VexTransitionPenalties.html) Remove useless save/restore of ctx pointer, nothing modifies it and it prevents cpus from doing cross-function memory renaming (https://www.agner.org/forum/viewtopic.php?t=41). Could not remove the space on stack because of alignment issues, instead turned it into GUEST_SCRATCH64 which is a temporary that sequences may use Reorder OpcodeInfo so that name is at offset 0, remove name and add GetOpcodeName function (name is only used for debug code, we are seperating frequently accessed data and rarely accessed data) Add VECTOR_DENORMFLUSH opcode for handling output to DOT_PRODUCT and other opcodes that implicitly force denormal inputs/outputs to zero, will eventually use for implementing NJM Rewrite sequences for LOAD_VECTOR_SHL/SHR. The mask with 0xf in it was pointless as all InstrEmit_ functions that create the load shift instructions do that in HIR. The tables are only used for nonzero constant inputs now, which are probably pretty rare. Instead of doing a shift and lookup, a base value is used for both in the constant table and adding/subtracting of the input is done Reuse result of LoadVectorShl/Shr in InstrEmit_stvlx_, InstrEmit_stvrx_. We were previously calculating it twice which was contributing to the final sequences' fatness. Use OPCODE_SELECT instead of the sequence of or, andnot, and that it was using for merging Add the proper unconditional denormal input flushing behavior to vfmadd, add it also to vfmsub (making the assumption it has the same behavior) Remove constant propagation for DOT_PRODUCT_3/4 DOT_PRODUCT_3/4 now returns a vector with all four elements set to the result. (what we were doing before, truncating to float32 and then splatting didnt make any sense) Add much more correct versions of DOT_PRODUCT_3/4, matching the Xb360's to 1 bit. Still needs work to be a perfect emulation. Add constant folding for OPCODE_SELECT, OPCODE_INSERT, OPCODE_PERMUTE, OPCODE_SWIZZLE Remove constant folding for DOT_PRODUCT Removed the multibyte nop code I committed earlier, it doesnt help us much because nops are only used for debug stuff and its ugly and wouldnt survive in a pr to main Check for AVX512BMI, use vpermb to shuffle if supported	2022-07-16 10:25:04 -07:00
Radosław Gliński	5f11c5d3b4	Merge pull request #49 from chrisps/canary_experimental Optimized GetScalarNZM, add limit to how far it can recurse.	2022-07-15 00:03:17 +02:00
chss95cs@gmail.com	71c5f8f0fa	Optimized GetScalarNZM, add limit to how far it can recurse. Add rlwinm elimination rule	2022-07-14 14:32:14 -07:00
Gliniak	1d00372e6b	Merge branch 'master' of https://github.com/xenia-project/xenia into canary_experimental	2022-07-10 10:50:39 +02:00
Triang3l	b41bb35a20	[SPIR-V] Make interpolators an array to fix Adreno linkage	2022-07-09 17:52:26 +03:00
Triang3l	b3edc56576	[Vulkan] Merge texture and sampler descriptors into a single descriptor set Put all descriptors used by translated shaders in up to 4 descriptor sets, which is the minimum required, and the most common on Android, `maxBoundDescriptorSets` device limit value	2022-07-09 17:10:28 +03:00
Triang3l	ff35a4b3a1	[Third-party] Revert premake-core downgrade caused by a merge	2022-07-09 13:43:53 +03:00
Gliniak	d33be73f3d	Fixed crash caused by hash calculation in specific cases	2022-07-08 08:49:43 +02:00
Triang3l	e4de8663c4	[Vulkan] All guest draw uniform buffer bindings in a single descriptor set Reduce the number of bound descriptor sets from 10 to 6, which is still above the minimum limit of 4, but closer	2022-07-07 21:05:56 +03:00
Triang3l	88c055eb30	[CPU] Null backend enough for GPU trace viewing	2022-07-06 23:28:06 +03:00
Triang3l	3ee68d79ea	Revert "[GPU] Make Processor optional for GraphicsSystem setup" The Processor is still required in many places, including the GPU command processor worker thread This reverts commit `fd03d886e9`.	2022-07-06 22:43:40 +03:00
Triang3l	2507837e8d	[Drone] Build cpu and gpu-trace-dump for Arm64	2022-07-06 21:34:51 +03:00
Triang3l	6852e54937	[CPU] Remove intrinsics from dot product constant propagation	2022-07-06 21:32:56 +03:00
Triang3l	326e718035	[CPU] MMIO: Arm64, load register writes + exception cleanup	2022-07-06 21:05:05 +03:00
Triang3l	fd03d886e9	[GPU] Make Processor optional for GraphicsSystem setup	2022-07-05 21:21:22 +03:00
Triang3l	bdfd410b13	[CPU] Cleanup x64 backend usage conditionals	2022-07-05 21:07:10 +03:00
Triang3l	c1efd560fb	[Drone] Enable newly buildable Android targets	2022-07-05 20:48:41 +03:00
Triang3l	d263d508cd	[GPU] Make operator< const	2022-07-05 20:47:53 +03:00
Triang3l	536f14d94c	[GPU] Fix a typo in a Neon intrinsic name	2022-07-05 20:47:34 +03:00
Triang3l	d51fafd07c	[Base] Linux Arm64 exception handler	2022-07-05 20:46:49 +03:00
Triang3l	2d5602447e	[Drone] Windowed apps are not yet ready for Android building (kind WindowedApp unsupported)	2022-07-04 21:28:39 +03:00
Triang3l	10819a4174	[Drone] Update Android targets	2022-07-04 23:53:55 +03:00
Triang3l	40aa73f7d7	[Linux] Swap read/write in x64 page fault handler + exception code cleanup	2022-07-04 23:51:26 +03:00
Triang3l	a9cbd9cc5f	[Linux] Update RIP after handling an exception	2022-07-04 23:24:26 +03:00
uytvbn	54aac81268	[Linux] Implement exception handler	2022-07-04 23:04:27 +03:00
Triang3l	35d4ea59c6	[Base] Remove exception_handler_linux.cc	2022-07-04 23:02:11 +03:00
Triang3l	feaad639fb	[Vulkan] Destroy all RTs before VulkanRenderTargetCache is destroyed	2022-07-04 11:27:51 +03:00
Gliniak	6e753c6399	Merge branch 'master' of https://github.com/xenia-project/xenia into canary_experimental	2022-07-04 08:11:04 +02:00
Triang3l	2621dabf0f	[Vulkan] Native 24-bit unorm depth where available	2022-07-03 21:21:17 +03:00
Triang3l	83e9984539	[Vulkan] Remove required feature checks Fallbacks for those will be added more or less soon, the stable version won't hard-require anything beyond 1.0 and the portability subset	2022-07-03 20:54:34 +03:00
Triang3l	bbae909fd7	[GPU] Reasons to keep non-Vulkan backends [ci skip]	2022-07-03 20:39:44 +03:00
Triang3l	ed61e15fc3	[App] Make D3D12 the default GPU backend on Windows again	2022-07-03 19:49:11 +03:00
Triang3l	ee84f4e267	[Vulkan] Update title bar warning	2022-07-03 19:45:48 +03:00
Triang3l	f7ef051025	[Vulkan] Disable validation by default	2022-07-03 19:42:22 +03:00
Triang3l	001f64852c	[Vulkan] VMA for textures	2022-07-03 19:40:48 +03:00
Gliniak	a8df744ea6	Merge branch 'master' of https://github.com/xenia-project/xenia into canary_experimental	2022-07-03 13:07:15 +02:00
Triang3l	636585e0aa	[Vulkan] Trace viewer	2022-07-01 19:53:41 +03:00
Triang3l	ad1ef84145	Merge branch 'master' into vulkan	2022-07-01 19:53:08 +03:00
Triang3l	e37e3ef382	[GPU] Display swap output in the trace viewer Resolve output is unreliable because resolving may be done to a subregion of a texture and even to 3D textures, and to any color format	2022-07-01 19:50:19 +03:00
Triang3l	c8a4a9504f	[Vulkan] Remove an unneeded scale from RefreshGuestOutput aspect ratio	2022-07-01 12:52:12 +03:00
Triang3l	d174762a40	Merge branch 'master' into vulkan	2022-07-01 12:51:34 +03:00
Triang3l	28670d8ec2	[UI] Presenter: Rename display size to aspect ratio	2022-07-01 12:50:45 +03:00
Triang3l	f8b351138e	[Vulkan] Alpha test	2022-06-30 22:20:51 +03:00
Triang3l	6772c88141	Merge branch 'master' into vulkan	2022-06-30 22:15:29 +03:00
Triang3l	7e691d5ef1	[DXBC] Handle NaN in not equal alpha test as passed	2022-06-30 22:15:01 +03:00
Triang3l	c0c3666e12	[Vulkan] Align texture extents in loading to vector size accessed by the shader Fixes loading of the 1x1 linear 8_8_8_8 texture containing just a single #FFFFFFFF texel in 4D5307E6, which is used for screen fade and the lobby map loading bar background	2022-06-29 23:41:32 +03:00
Triang3l	9392fff369	Merge branch 'master' into vulkan	2022-06-29 23:39:54 +03:00
Triang3l	a11b070fee	[GPU] Align texture extents in loading to host buffer texel size accessed by the shader	2022-06-29 23:38:06 +03:00
Triang3l	7c2df55209	[Vulkan] Cache clear: shared memory, scratch buffer	2022-06-29 13:24:45 +03:00

1 2 3 4 5 ...

6971 Commits All Branches Search

6971 Commits

All Branches