pcsx2

Commit Graph

Author	SHA1	Message	Date
Gregory Hainaut	3f404c8edb	gsdx-ogl: update shader pipeline intertace to use vs/gs/ps triplet Better to have 1 function calls with 3 parameters rather 3 functions call with 1 parameter.	2016-04-10 14:14:30 +02:00
Gregory Hainaut	f751f70b1e	gsdx ogl: GL_ARB_clip_control is now mandatory	2016-04-07 21:57:54 +02:00
Gregory Hainaut	f8c442cf76	gsdx-ogl: make VS more generic Texture coordinate could be dummy/float/int integral/int normalized. Old behavior: * VS was in charge to select the texture coordinate * int integral format wasn't supported New behavior: * Always compute all formats * FS will be in charge to select the good format Impact: * VS will be slightly slower but it reduces shaders permutation from little to 0 (won't be bad for CPU) * FS speed isn't impacted as 2 separate code paths were already required to support both format * Rasterizer will be 33% slower but unlikely to be the limited factor of the GPU * In future we could directly use the integral format in the FS. V2: remove useless PSin_t	2016-02-18 19:02:05 +01:00
Gregory Hainaut	5cbd0cf42a	gsdx-debug: add more push/pop debug group on the init	2015-10-24 14:14:37 +02:00
Gregory Hainaut	0958b9db8e	gsdx-ogl: use the standard openGL name Function pointer was mangled to avoid any collision. Nowadays all symbols are hidden so no risk of collision. Syntax is nicer beside it would allow to put back GLES3.2. I think it supports most of the used extension. glActiveTexture & glBlendColor are provided without symbol query.	2015-10-17 17:05:15 +02:00
Gregory Hainaut	921b22ac31	gsdx-ogl: drop useless parameter of OMSetDepthStencilState	2015-08-05 22:55:12 +02:00
Gregory Hainaut	b7e16b5989	gsdx-ogl: clean the blending management Intially GSBlendStateOGL was an alias of the m_blendMapD3D9 array The object was replaced by an index in the array. Save 2k of memory duplication. And too much useless code. v2: push/pop blending state in DATE stuff v3: remove m_state which is useless now	2015-08-05 22:55:12 +02:00
Gregory Hainaut	6e1cd077a9	gsdx-debug: push ubo in a group	2015-08-03 08:46:25 +02:00
Gregory Hainaut	97b38d9e1b	gsdx-ogl: directly set impossible mode in the blending table Avoid to hack it in the creation Allow in the future to reuse the table directly instead of converting in a blend object	2015-07-31 09:45:28 +02:00
Gregory Hainaut	8554f32086	gsdx-ogl: clean the blend table Remove old shader define Prefix macro with BLEND_ Add some notes to explain the special symbol	2015-07-31 09:45:28 +02:00
Gregory Hainaut	cfd0fd6cc8	gsdx-ogl: remove old colclip algo	2015-07-31 09:45:28 +02:00
Gregory Hainaut	93c47feb7c	gsdx-ogl: replace old colclip algo with the HDR algo Similar speed but more accurate Allow to clean the code	2015-07-31 09:45:28 +02:00
Gregory Hainaut	83f874db93	gsdx-ogl: remove bsel.ps Just clear bsel.abe to disable blending	2015-07-31 09:45:28 +02:00
Gregory Hainaut	83dfc6b633	gsdx-ogl: clean a bit selector code Use countof macro (avoid to duplicate the size) Fix the size of array Remove useless alpha_stencil case	2015-07-30 18:36:05 +02:00
Gregory Hainaut	ae8df002af	gsdx-ogl: optimize Cs * As + Cd and Cs * Af + Cd blending Basically the code does the alpha multiplication in the shader therefore the blend unit only does a pure addition. This way the multiplication is accurate and accurate_blending doesn't requires a costly barrier. This code also avoid variable duplication to make the code more separated. Hopefully blending can be done in a separated function It is preliminary work to support fast color clipping with HDR v2: fix assertion compilation failure v3: fix regression in not accurate mode v3: Cs * As/Af is not an accumulation Those cases don't need the Cd addition and were already optimized anyway Fix a regression on GoW2	2015-07-30 18:22:59 +02:00
Gregory Hainaut	8c8fe633a5	gsdx-ogl: merge 3 accurate* option into a nice combobox It is much easier to configure this way	2015-07-30 18:21:00 +02:00
Gregory Hainaut	b4c04ed00a	gsdx-ogl: wipeout subroutine code Code was completey bitrotten Code was a partial test (and yet 500 lines already) Shader is more and more complex and multithreading support greatly reduce the cost of shader switch	2015-07-17 21:08:49 +02:00
Gregory Hainaut	2ccf108534	gsdx-ogl: add back a selector for the Geometry Shader	2015-07-11 14:35:34 +02:00
Gregory Hainaut	621d1d93db	gsdx-linux: drop the logz option logz will only enabled if GL_ARB_clip_control is not found. Note: logz will likely break depth support. Be sure to use a recent driver	2015-07-03 21:29:49 +02:00
Gregory Hainaut	9ee3a173d0	gsdx-ogl: use a local ALPHA register It would allow to easy tune the parameter to support 24 bits format	2015-05-26 15:36:48 +02:00
Gregory Hainaut	7f614401a6	gsdx-ogl: set accurate_blend as default Rendering is better and I think speed impact remains small.	2015-05-23 15:10:04 +02:00
Gregory Hainaut	8d3e3e6c5b	gsdx-ogl: more blend rework to support accurate_colclip So far few blending equations are implemented in PS. It is only for test the behavior on GoW	2015-05-20 08:00:40 +02:00
Gregory Hainaut	c5341a2711	gsdx-ogl: update blending management This way it will allow to implement all blendings operartion in FS. Of course it will be slow, but it would be nice for debug and quickly check game error rendering.	2015-05-20 00:12:52 +02:00
Gregory Hainaut	540717b295	gsdx-ogl: don't force logz only on debug build	2015-05-18 07:24:26 +02:00
Gregory Hainaut	e0012811ae	gsdx-debug: more debug info in gl trace	2015-05-12 12:36:34 +02:00
Gregory Hainaut	bc5de8df6b	gsdx-ogl: disable FF blend when blend is done in shader	2015-05-08 20:28:50 +02:00
Gregory Hainaut	380e420cdd	gsdx-ogl: add a blend parameter to shader	2015-05-08 20:28:50 +02:00
Gregory Hainaut	e87d129b09	gsdx-ogl: more verbose debug	2015-05-08 01:01:01 +02:00
Gregory Hainaut	530e4ce776	gsdx-ogl: drop hack that rescale primitive (to avoid upscale glitch) Rendering is bad. It renders sprites at native resolution.	2015-05-06 19:09:13 +02:00
Gregory Hainaut	8032e2c369	gsdx-ogl: separate color mask state from the blending state Unlike DX they're uncorrelated.	2015-05-05 10:26:01 +02:00
Gregory Hainaut	e17ba1c143	gsdx ogl: disable logz when ARB_clip_control is supported Basically any recent drivers but Catalyst...	2015-05-01 15:01:08 +02:00
Gregory Hainaut	2bd9043657	gsdx-ogl: improve debug of stencil Note: ENABLE_OGL_STENCIL_DEBUG could be dropped now that I can read the stencil properly	2015-04-30 19:55:57 +02:00
Gregory Hainaut	31f8c065db	gsdx-ogl: implement a new hack UserHacks_UnscaleSprite for opengl UserHacks_UnscaleSprite = 1 will unscale flat sprites UserHacks_UnscaleSprite = 2 will unscale all sprites (don't work well so far) The idea of the hack is to redo the interpolation of texture coordinate based on the non-upscaled pixel position. It avoids various glitches but sprites aren't upscaled anymore (so no more anti-aliasing, potentially a coefficient can be added).	2015-04-20 07:18:08 +02:00
Gregory Hainaut	6124eb844e	gsdx-ogl: only compile useful VS logz is a constant wildhack is only compatbile with TME/FST Compilation goes down from 64 to 20 vertex shaders.	2015-04-20 07:17:58 +02:00
Gregory Hainaut	ccc1137e12	gsdx-ogl: merge the two vertex buffer format * Only a single VAO => Format is set once => Only a single bind at startup => GSVertexBufferStateOGL is nearly useless => barely faster but better than nothing :)	2014-10-02 20:44:22 +02:00
Gregory Hainaut	4659184cc1	gsdx-ogl: add support of clip_control (depth only) * replace the [-1;1] depth range of openGL with the DX range [0;1].	2014-09-28 12:00:34 +02:00
Gregory Hainaut	b020bd76c6	gsdx-ogl: restore gles build Add the --gles build option to the linux main script Ifdef all gl code not supported on gles3 (note some will be reenabled for gles3.1) Note: it probably doesn't run anymore. My Nvidia driver doesn't support yet egl/gles so I can't test it. Feel free to contribute.	2014-03-29 11:55:02 +01:00
gregory.hainaut	384c0c12ea	gsdx ogl: * properly detect gl nv depth extension * Always show the hack on the gui. Add a new hack option for DATE (gl4.2) only * Save the scan mode on linux too (f7) * hopefully fix some crash on some drivers... (ensure aligment 256 bits alignment, and if not use std memcpy) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5888 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-07 19:53:01 +00:00
gregory.hainaut	84895eadd9	gsdx ogl: correct most of Z-depth issue Best setting if you driver support GL_NV_depth_buffer_float => GL_NV_Depth = 1 & logz = 0 Otherwise => GL_NV_Depth = 0 & logz = 1 Explanation of the bug: Dx z position ranges from 0.0f to 1.0f (FS ranges 0.0f to 1.0f) GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f) Why it sucks: GS small depth value will be "mapped" to -1.0f. In others all small values will be 1.0f! Terrible lost of accuraccy. The GL_NV_depth_buffer_float extension allow to set the near plane as -1.0f. So "GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f)" will become "GL z Position ranges from -1.0f to 1.0f (FS ranges -1.0f to 1.0f)" and therefore "z posision [0.0f;1.0f] will map to FS [0.0f;1.0f]" as DX Yes we just get back all precision lost previously :) However you need hardware (intel?) and driver support (free driver?/gles?) :( git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5860 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-31 21:30:54 +00:00
gregory.hainaut	e80b002929	gsdx ogl: Flush various pending work * try to use more subroutine on VS&PS, unfortunately hit a driver crash! * Call Attach/DetachContext through GSDevice so I can unmap currently mapped buffer * Implement glsl part of GL_ARB_bindless texture, again hit another driver crash! * various fix of GL_ARB_buffer_storage. Basic benchmark show only improvement on 'cold' case, I guess it will improve smoothness * try to fix GL_clear_texture, no success so far. It seem the extension is limited to basic texture (aka no depth/stencil) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5752 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-10-24 20:54:27 +00:00
gregory.hainaut	e01c6cd9ce	gsdx ogl: the proof of concept commit * GL_ARB_shader_subroutine for perf fix for nvidia => add missing shader declaration. Nvidia got +4fps on colin3 :) For the moment only 2 PS parameters are supported. Code need to be extended to support others games that often switch shader program (like xenosaga). require GL4 class hardware and the option override_GL_ARB_shader_subroutine = 1 Note: strangely on AMD linux it is slower! * GL_ARB_shader_image_load_store for accuraccy (Date) Use a signed integer texture and reenable color buffer writing Current status: Amagami_transparency.gs & P3_battle_shadows.gs are now working on Nvidia with a small perf impact. Current implementation detail: 1/ setup the standard stencil as before 2/ on remaining pixel, draw once to compute first primitive that will write a fail alpha value. 3/ final draw based on primitive id of step 2 Note: I think we would get a bad behavior if depth test&mask are enabled on step 2/3 Note2: on my limited testcase the perf impact was on CPU. It would be possible to merge step1&2 to nullifying it (could even be faster actually), however it would require more GPU power. Again require GL4 class hardware. And the option UserHacks_DateGL4 = 1 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5725 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-28 08:44:16 +00:00
gregory.hainaut	07605941ef	gsdx ogl: * some preliminary work to test/benchmark bindless texture in the future (glsl was not yet updated) Bindless texture allow to get a GPU texture pointer and then set it directly to the shader as a basic uniform. => no more texture unit selection/validation => no more texture validation neither texture hash lookup 3rdparty: update gl header to the latest gl4.4 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5720 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-17 08:57:52 +00:00
gregory.hainaut	b4084047be	gsdx ogl: Used a basic flat interpolation for color interpolation (line & tri primitives) Card that support gs: remain only a gs to generate sprite from a line. Even dummy gs are costly for the GPU. Card that don't support gs: remove useless copy of color for line and triangle primitives Note for dx: opengl 3.2 (maybe not gles) supports both flat interpolation convention (GL_FIRST_VERTEX_CONVENTION or GL_LAST_VERTEX_CONVENTION). It might be possible to shuffle vertex index to put the last vertex in first position. - buff[0] = head + 0; - buff[1] = head + 1; - buff[2] = head + 2; + buff[0] = head + 2; + buff[1] = head + 1; + buff[2] = head + 0; git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5718 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-14 10:18:38 +00:00
gregory.hainaut	0f603a98d5	gsdx ogl: Test the ARB_shader_subroutine GL4.0 extension The idea was to replace shader program swith by pointer function calls inside shaders. At least parameters that are often changed between draw call. So far I only ported atst and colclip. Unfortunately code is "slower" (on GSdx standalone). For the moment keep the code but disabled. If I understand well the validation of program is done in the "driver thread" but the additional call are done in the overloaded MTGS thread. Apitrace profiling shows faster GPU draw calls. Another possibility is that the driver still need to validate the draw call because of others state change. Here some stats on colin3 (90 frames): without subroutine: UseProgram 125246 with subroutine: UseProgram 2906, subroutine 125945 => 3605 extra calls overhead (not all parameters are ported to subroutine) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5715 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-10 19:43:59 +00:00
gregory.hainaut	a46b489a24	gsdx ogl: various minor optimization. * move most of gl states into a separate namespace. Extend it to depth/stencil/blend micro state => save 10,000 opengl call by frame for colin mcrae 3 * Only setup blend state of first drawbuffer * Don't request anymore a debug context on dev/release build git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5713 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-05 20:25:25 +00:00
gregory.hainaut	64f783410e	gsdx ogl: * preliminary work for GL4.4 extensions (ARB_clear_texture & ARB_multi_bind). Disabled until I got a 4.4 driver Note: I plan also to use ARB_buffer_storage * compute texture gl option in the constructor (avoid a couple of swith case) * redo texture unit management. Unit 0-2 for shaders, Unit 3 for texture operations. MultiBind will allow to bind shader input without disturbing texture binding points. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5711 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-02 16:38:12 +00:00
gregory.hainaut	d3e1e4da0d	gsdx ogl: * allow to switch renderer with F9 * skip first frame in stat of the replayer * drop msaa. Fxaa and internal resolution will do the job * move texture attachment from texture object into device object (allow to keep sanely the state) * split the write buffer and attachment setup * completely split sampler and texture input setup * redo GSDeviceOGL::CopyOffscreen to avoid an extra copy. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5704 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-19 19:25:50 +00:00
gregory.hainaut	f22b366cea	gsdx: * redo glsl2h.pl script to generate only one big glsl headers. * fix gcc warning in GSVector.h * fix memory leak of GSDeviceOGL.m_shader * clean shader compilation function => split generation header & drop malloc stuff cmake: * only rebuild shader when asked by the use. Avoid perl dependency to build pcsx2 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5698 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-07 16:13:11 +00:00
gregory.hainaut	dc036ff59d	gsdx: split shader/program/pipeline object management into a separate class * remove the possibility to compile shader from file. Some people loads older shaders... The feature might be readded later git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5697 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-06 10:08:52 +00:00
gregory.hainaut	ca1edbf2cb	gsdx ogl: * Separate state and shader compilation into separate function * replace various hash_map by basic array * Compact VertexScale and offset into a single vec4 * add the new option "ogl_vertex_subdata": subdata is faster on FGLRX, test are welcome on Nvidia drivers 0 => use map/unmap 1 => use subdata replay: add "linux_replay" option and compute some nice stat (mean, standard deviation) cmake: recreate shader header at build time git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5682 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-26 20:09:07 +00:00

1 2

72 Commits