pcsx2

Commit Graph

Author	SHA1	Message	Date
Gregory Hainaut	1addae1993	gsdx-ogl: don't use extra shader for sprite hack Atst == 2 && sprite_hack is equivalent to Atst == 1 It frees a bit in the shader selector, and reduces shader combinations.	2015-05-08 19:28:16 +02:00
Gregory Hainaut	530e4ce776	gsdx-ogl: drop hack that rescale primitive (to avoid upscale glitch) Rendering is bad. It renders sprites at native resolution.	2015-05-06 19:09:13 +02:00
Gregory Hainaut	ae70344fbc	gsdx-ogl: fix 2 passes DATE with GL42 (override_GL_ARB_shader_image_load_store = 1) It fixes shadows issue with Persona 3	2015-05-02 16:56:18 +02:00
Gregory Hainaut	73d04e33e9	gsdx ogl: clean various comment and old code	2015-05-01 20:04:23 +02:00
Gregory Hainaut	335695bd0e	purge GLES from GSdx ! mobile will use vulkan (or any new API) anyway	2015-05-01 20:02:17 +02:00
Gregory Hainaut	71e517108d	gsdx-ogl: fix colclip Sotc shadow and likely various games :) Thanks to pseudo	2015-04-30 23:08:05 +02:00
Gregory Hainaut	b65a62096f	gsdx-ogl: drop support of ENABLE_OGL_DEBUG Stencil can be read by GL debugger due to correct mask configuration	2015-04-30 20:02:51 +02:00
Gregory Hainaut	eaa0dae305	glsl: drop invariant qualifier Fix issue #510 and potentially it allows the compiler to better optimize the code	2015-04-27 18:52:55 +02:00
Gregory Hainaut	46ff4dc3d3	gsdx-ogl: hardware unit only support normalization of 4 bytes... (At least on recent AMD GPU)	2015-04-27 18:51:59 +02:00
Gregory Hainaut	301fe869c7	glsl: improve the logz handling This way only 0 and 1 will conflict	2015-04-26 17:51:33 +02:00
Gregory Hainaut	6a3250cad2	gsdx-ogl: fix logz management (FFXII) The issue is that z+1 overflow if z is UINT max log2(z) overflow if z is 0 So let's use an or 1 to avoid both overflow	2015-04-25 20:23:26 +02:00
Gregory Hainaut	15ae9996bb	glsl: format white space + comment	2015-04-25 12:50:12 +02:00
Gregory Hainaut	36514bd95f	glsl: fog is a single byte Give a chance to the driver to optimize if possible	2015-04-24 21:37:37 +02:00
Gregory Hainaut	c207632e49	gsdx-ogl: improve date performance for GL45 If there is no overlap, it is allowed to directly read from the render target. On SotC testcase with 6x scaling: 30fps -> 40fps Note: it requires GL_ARB_texture_barrier extension so be sure to have a recent driver Note2: it requires a lots of testing too Open question: in case of complex date (written alpha) Will it be faster to split the draw call into multiple call with no primitive overlap	2015-04-24 21:12:33 +02:00
Gregory Hainaut	795ae50ecd	gsdx-ogl: fix the recently broken advance date feature Now it is really working with a 2 stages shaders but it is still slow.	2015-04-24 20:13:38 +02:00
Gregory Hainaut	89d5e5637c	glsl: use an explicit cast insead of notEqual function If the compiler didn't optimize the code, it will be a bit faster	2015-04-24 18:01:25 +02:00
Gregory Hainaut	56836561f4	glsl: replace runtime condition by preprocessor condition It might be an easier work for the compiler I didn't replace all occurences to keep readability	2015-04-24 17:51:29 +02:00
Gregory Hainaut	62489f42f1	gsdx-ogl: add an optimization note for later Only 1 byte of fog is useful	2015-04-20 07:18:09 +02:00
Gregory Hainaut	6d253c0b8f	glsl: fix debugging of tex coordinate in apitrace	2015-04-20 07:18:08 +02:00
Gregory Hainaut	31f8c065db	gsdx-ogl: implement a new hack UserHacks_UnscaleSprite for opengl UserHacks_UnscaleSprite = 1 will unscale flat sprites UserHacks_UnscaleSprite = 2 will unscale all sprites (don't work well so far) The idea of the hack is to redo the interpolation of texture coordinate based on the non-upscaled pixel position. It avoids various glitches but sprites aren't upscaled anymore (so no more anti-aliasing, potentially a coefficient can be added).	2015-04-20 07:18:08 +02:00
Gregory Hainaut	15264c6c63	glsl: split the main shader * separate VS/GS and FS * separate subroutine part of the FS It already complex enough without subroutine stuff. Besides I'm not sure we will keep subroutine on the future.	2015-04-19 18:49:02 +02:00
Gregory Hainaut	418f2e69a8	gsdx-ogl: implement the wildhack on the GPU Likely much faster for opengl and much easier to implement Note: hopefully UserHacks_round_sprite_offset will replace it	2015-04-13 22:14:36 +02:00
Gregory Hainaut	05eb1641f2	gsdx: add glsl debug capabilities Useful to debug bad texture sampling	2015-04-08 10:15:25 +02:00
Gregory Hainaut	e40a26be19	gsdx: add some helper to debug openGL renderer	2015-04-03 20:28:44 +02:00
Gregory Hainaut	29a14f5667	sed -i -e 's/seperately/separately/' Thanks micove again	2014-12-07 12:46:16 +01:00
Gregory Hainaut	16284d0a59	gsdx: fix a couple of typo From @micove	2014-12-07 12:12:53 +01:00
Gregory Hainaut	d21e6ff45f	gsdx-ogl: don't declare a sampler variable Nvidia was complaining on the fxaa shader. In doubt I also replaced it in Asmodean's shader	2014-11-20 22:07:44 +01:00
Gregory Hainaut	8c90e7cafc	gsdx-ogl: support latest fxaa version Only tested on Nvidia, please report any issue with your driver Note: requires GL4 GPU	2014-11-10 10:39:55 +01:00
Gregory Hainaut	84f844767c	gsdx-ogl: micro optimize PSConstantBuffer cache Might help to save a cache line on the CPU :)	2014-11-08 21:39:17 +01:00
Gregory Hainaut	ccc1137e12	gsdx-ogl: merge the two vertex buffer format * Only a single VAO => Format is set once => Only a single bind at startup => GSVertexBufferStateOGL is nearly useless => barely faster but better than nothing :)	2014-10-02 20:44:22 +02:00
Gregory Hainaut	d37cc8e1e7	gsdx-ogl: GL_ARB_shading_language_420pack is now mandatory All API/drivers support it so time to remove the fallback.	2014-10-02 20:44:22 +02:00
Gregory Hainaut	594f6c33a2	gsdx-ogl-ES: require GL_EXT_shader_io_blocks + GLES3.1 Allow to use same shader interface for all API Note: on the GL API it will require GL3.3 (see next commit)	2014-09-30 22:18:01 +02:00
Gregory Hainaut	4659184cc1	gsdx-ogl: add support of clip_control (depth only) * replace the [-1;1] depth range of openGL with the DX range [0;1].	2014-09-28 12:00:34 +02:00
Gregory Hainaut	0d45e6d70e	gsdx-ogl: avoid to send constant to the GPU It was a waste of bandwith	2014-04-06 10:44:40 +02:00
gregory.hainaut	384c0c12ea	gsdx ogl: * properly detect gl nv depth extension * Always show the hack on the gui. Add a new hack option for DATE (gl4.2) only * Save the scan mode on linux too (f7) * hopefully fix some crash on some drivers... (ensure aligment 256 bits alignment, and if not use std memcpy) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5888 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-07 19:53:01 +00:00
gregory.hainaut	6f3326a9a4	gsdx ogl: Fog hide this typo :p git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5861 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-01 10:14:01 +00:00
gregory.hainaut	84895eadd9	gsdx ogl: correct most of Z-depth issue Best setting if you driver support GL_NV_depth_buffer_float => GL_NV_Depth = 1 & logz = 0 Otherwise => GL_NV_Depth = 0 & logz = 1 Explanation of the bug: Dx z position ranges from 0.0f to 1.0f (FS ranges 0.0f to 1.0f) GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f) Why it sucks: GS small depth value will be "mapped" to -1.0f. In others all small values will be 1.0f! Terrible lost of accuraccy. The GL_NV_depth_buffer_float extension allow to set the near plane as -1.0f. So "GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f)" will become "GL z Position ranges from -1.0f to 1.0f (FS ranges -1.0f to 1.0f)" and therefore "z posision [0.0f;1.0f] will map to FS [0.0f;1.0f]" as DX Yes we just get back all precision lost previously :) However you need hardware (intel?) and driver support (free driver?/gles?) :( git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5860 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-31 21:30:54 +00:00
gregory.hainaut	c2aa4ff3fd	gsdx ogl: * restore the old fxaa (Asmodeam will be integrated when I got time) * port the recently added new scanline algo git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5818 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-18 14:46:13 +00:00
gregory.hainaut	23ae6f0641	the regular copy/past mistake git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5811 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 12:35:15 +00:00
gregory.hainaut	2238095a82	gsdx ogl: do the same as previous commit but for ogl ;) * fix the missing auto interlace opt when cycling with hotkey on linux git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5810 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 11:38:50 +00:00
gregory.hainaut	f5257b2c3c	gsdx ogl: don't rely on GL_ES builtin (3 differents implementations...). Hopefully fix Linux Intel GPU. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5808 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 15:01:13 +00:00
gregory.hainaut	e80b002929	gsdx ogl: Flush various pending work * try to use more subroutine on VS&PS, unfortunately hit a driver crash! * Call Attach/DetachContext through GSDevice so I can unmap currently mapped buffer * Implement glsl part of GL_ARB_bindless texture, again hit another driver crash! * various fix of GL_ARB_buffer_storage. Basic benchmark show only improvement on 'cold' case, I guess it will improve smoothness * try to fix GL_clear_texture, no success so far. It seem the extension is limited to basic texture (aka no depth/stencil) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5752 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-10-24 20:54:27 +00:00
gregory.hainaut	e01c6cd9ce	gsdx ogl: the proof of concept commit * GL_ARB_shader_subroutine for perf fix for nvidia => add missing shader declaration. Nvidia got +4fps on colin3 :) For the moment only 2 PS parameters are supported. Code need to be extended to support others games that often switch shader program (like xenosaga). require GL4 class hardware and the option override_GL_ARB_shader_subroutine = 1 Note: strangely on AMD linux it is slower! * GL_ARB_shader_image_load_store for accuraccy (Date) Use a signed integer texture and reenable color buffer writing Current status: Amagami_transparency.gs & P3_battle_shadows.gs are now working on Nvidia with a small perf impact. Current implementation detail: 1/ setup the standard stencil as before 2/ on remaining pixel, draw once to compute first primitive that will write a fail alpha value. 3/ final draw based on primitive id of step 2 Note: I think we would get a bad behavior if depth test&mask are enabled on step 2/3 Note2: on my limited testcase the perf impact was on CPU. It would be possible to merge step1&2 to nullifying it (could even be faster actually), however it would require more GPU power. Again require GL4 class hardware. And the option UserHacks_DateGL4 = 1 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5725 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-28 08:44:16 +00:00
gregory.hainaut	07605941ef	gsdx ogl: * some preliminary work to test/benchmark bindless texture in the future (glsl was not yet updated) Bindless texture allow to get a GPU texture pointer and then set it directly to the shader as a basic uniform. => no more texture unit selection/validation => no more texture validation neither texture hash lookup 3rdparty: update gl header to the latest gl4.4 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5720 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-17 08:57:52 +00:00
gregory.hainaut	b4084047be	gsdx ogl: Used a basic flat interpolation for color interpolation (line & tri primitives) Card that support gs: remain only a gs to generate sprite from a line. Even dummy gs are costly for the GPU. Card that don't support gs: remove useless copy of color for line and triangle primitives Note for dx: opengl 3.2 (maybe not gles) supports both flat interpolation convention (GL_FIRST_VERTEX_CONVENTION or GL_LAST_VERTEX_CONVENTION). It might be possible to shuffle vertex index to put the last vertex in first position. - buff[0] = head + 0; - buff[1] = head + 1; - buff[2] = head + 2; + buff[0] = head + 2; + buff[1] = head + 1; + buff[2] = head + 0; git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5718 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-14 10:18:38 +00:00
gregory.hainaut	0f603a98d5	gsdx ogl: Test the ARB_shader_subroutine GL4.0 extension The idea was to replace shader program swith by pointer function calls inside shaders. At least parameters that are often changed between draw call. So far I only ported atst and colclip. Unfortunately code is "slower" (on GSdx standalone). For the moment keep the code but disabled. If I understand well the validation of program is done in the "driver thread" but the additional call are done in the overloaded MTGS thread. Apitrace profiling shows faster GPU draw calls. Another possibility is that the driver still need to validate the draw call because of others state change. Here some stats on colin3 (90 frames): without subroutine: UseProgram 125246 with subroutine: UseProgram 2906, subroutine 125945 => 3605 extra calls overhead (not all parameters are ported to subroutine) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5715 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-10 19:43:59 +00:00
gregory.hainaut	64f783410e	gsdx ogl: * preliminary work for GL4.4 extensions (ARB_clear_texture & ARB_multi_bind). Disabled until I got a 4.4 driver Note: I plan also to use ARB_buffer_storage * compute texture gl option in the constructor (avoid a couple of swith case) * redo texture unit management. Unit 0-2 for shaders, Unit 3 for texture operations. MultiBind will allow to bind shader input without disturbing texture binding points. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5711 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-02 16:38:12 +00:00
gregory.hainaut	e394de86dc	gsdx ogl: * add a non-working hack: UserHacks_DateGL4, goal was to replace UserHacks_AlphaStencil + Detection of good/bad samples is based on primitive ID variable. However I'm not sure the behavior is always the same between draw call...Anyway let's keep a copy of the current work * Dump integer texture into text csv * add gl4.2 ARB_shader_image_load_store extension (needed by UserHacks_DateGL4) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5707 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-28 14:40:43 +00:00
gregory.hainaut	b912778a3c	gsdx: forget to regenerate shader in previous commit git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5702 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-13 16:31:19 +00:00
gregory.hainaut	bd5f379044	gsdx ogl: * uniform was wrongly set before the activation of the program (free driver only) * Always use only 1 drawbuffer. Easier besides previous setup was wrong for convert:ps_main1 GLES trial (v3): * add the cmake option GLES_API. Note library (libgles) are hardcoded for the moment * Disable opengl check * Disable gl_GetDebugMessageLogARB not supported! * Emulate gl_DrawElementsBaseVertex, add manually the index offset (surely slow but work) * Fix hundred of shader error (no implicit cast of integer to float...) Unfortunately GLES doesn't support dual blending so no blend in hardware renderer. Otherwise it is fine git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5700 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-12 21:12:34 +00:00
gregory.hainaut	a764950468	gsdx ogl: * replace vertex interface with block interface. It avoid to depends on the ARB_sso extension. * disable geometry shader on Nvidia & Linux. Slower but better than a black screen ! * default logz to 1, avoid some glitches. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5699 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-11 17:08:42 +00:00
gregory.hainaut	f22b366cea	gsdx: * redo glsl2h.pl script to generate only one big glsl headers. * fix gcc warning in GSVector.h * fix memory leak of GSDeviceOGL.m_shader * clean shader compilation function => split generation header & drop malloc stuff cmake: * only rebuild shader when asked by the use. Avoid perl dependency to build pcsx2 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5698 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-07 16:13:11 +00:00

1 2 3

102 Commits