pcsx2

Commit Graph

Author	SHA1	Message	Date
Gregory Hainaut	78dd957717	gsdx-ogl: use normalized index coordinate for palette texture In palette mode, 90% of texture accesses are done in 8 bits. So let's keep this path as light as possible. It reduces GPU load.	2015-08-20 23:01:43 +02:00
Gregory Hainaut	6b84a89b6a	gsdx-ogl: remove old ati hack for point sampler Was never used on openGL In the future absolute coordinate will be use anyway	2015-08-18 19:09:45 +02:00
Miguel A. Colón Vélez	21c4e6caa6	Fix things found by lintian. I: pcsx2: spelling-error-in-binary usr/lib/i386-linux-gnu/pcsx2/libGSdx-1.0.0.so allows to allows one to I: pcsx2: spelling-error-in-binary usr/lib/i386-linux-gnu/pcsx2/libGSdx-1.0.0.so Allow to Allow one to . Apparently lintian checks grammar too (most common ones).	2015-08-17 22:00:07 -04:00
Gregory Hainaut	bd0de8fbaf	glsl: (colclip) HDR doesn't need wrapping neither clamping Might save a couple of instruction in the PS	2015-08-08 09:18:09 +02:00
Gregory Hainaut	d822b6bd1a	gsdx-ogl: add some comments for the future	2015-08-08 09:17:06 +02:00
Gregory Hainaut	b17803bb34	gsdx-ogl: wipeout of GL_ARB_bindless_texture Code is completely broken. It doesn't help to improve speed. Remove 200 lines ;)	2015-08-08 09:16:49 +02:00
Gregory Hainaut	36554c3375	Merge branch 'hdr-colclip-32bits'	2015-08-04 21:55:40 +02:00
Gregory Hainaut	45bb27d6db	gsdx-ogl: extend HDR colclip to 32 bits texture Unfortunately 16 bits wasn't enough for Castlevania.	2015-08-04 21:54:27 +02:00
Gregory Hainaut	744f9ebc09	gsdx-ogl: rare corner case when both texture shuffle and date are enabled In texture shuffle mode the texture data is either RG or BA. It means that DATE must either checks MSB of G or A. Close #693	2015-08-04 20:10:44 +02:00
Gregory Hainaut	4a3c145c72	gsdx-ogl: depth support: better support of 16 bits z buffer Fix issue in socom2	2015-08-01 01:28:41 +02:00
Gregory Hainaut	cfd0fd6cc8	gsdx-ogl: remove old colclip algo	2015-07-31 09:45:28 +02:00
Gregory Hainaut	93c47feb7c	gsdx-ogl: replace old colclip algo with the HDR algo Similar speed but more accurate Allow to clean the code	2015-07-31 09:45:28 +02:00
Gregory Hainaut	e026f1bac6	gsdx-ogl: implement a fast accurate colclip algo The idea is to use a floating texture to accumulate the data and then do a final postprocessing pass to apply the modulo v2: * use bounding box to * fix vertex corruption issue * use negative number in shader which allow to use half float (+12 fps@4x)	2015-07-30 18:34:52 +02:00
Gregory Hainaut	d191ae4aba	glsl: use rounding in convert 4 shader It will avoid texture rouding error with negative number	2015-07-30 18:22:59 +02:00
Gregory Hainaut	ae8df002af	gsdx-ogl: optimize Cs * As + Cd and Cs * Af + Cd blending Basically the code does the alpha multiplication in the shader therefore the blend unit only does a pure addition. This way the multiplication is accurate and accurate_blending doesn't requires a costly barrier. This code also avoid variable duplication to make the code more separated. Hopefully blending can be done in a separated function It is preliminary work to support fast color clipping with HDR v2: fix assertion compilation failure v3: fix regression in not accurate mode v3: Cs * As/Af is not an accumulation Those cases don't need the Cd addition and were already optimized anyway Fix a regression on GoW2	2015-07-30 18:22:59 +02:00
Gregory Hainaut	95c374bbcc	glsl: Round 0.95+ to 1 for texel colors It improves SotC blooming. Suikoden seems to be fine too.	2015-07-30 18:21:01 +02:00
Gregory Hainaut	223b7daa1d	gsdx-ogl: fix fbmask regression I forget to remove an useless 255.0f factor	2015-07-19 23:55:27 +02:00
Gregory Hainaut	5c740ff41e	gsdx-ogl: wipeout AlphaStencil & Alpha hack Accurate options do a better jobs. Technically it can still be useful for old gpu/driver that doesn't support the GL4.5 extension. On Windows, you can still rely on Dx On linux, free driver support it (except Intel)	2015-07-19 22:43:48 +02:00
Gregory Hainaut	38c086008d	glsl: round blending as SW renderer Better tone on God of War with ultra accurate mode	2015-07-18 17:16:46 +02:00
Gregory Hainaut	c4e165067d	glsl: use signed integer when working with substraction... It mostly fixes rendering of GoW with ultra blending Color is still a bit wrong but rounding is likely incorrect	2015-07-18 14:41:03 +02:00
Gregory Hainaut	c701ab4368	glsl: don't use normalized value for color range Globally shader uses less intruction (except blending part) It would also allow to improve the rounding of color	2015-07-18 14:41:03 +02:00
Gregory Hainaut	57394a03e0	partially revert: glsl: disable computing of extra alpha coeff in SW blending It generates undefined data in screen. Maybe blending isn't switch off properly. I need to investigate it.	2015-07-18 14:41:03 +02:00
Gregory Hainaut	036cb229a3	glsl: add various comment for future idea For example GL4 GPU supports special bit operation	2015-07-18 14:40:57 +02:00
Gregory Hainaut	36e83b42ce	glsl: round texture and fragment color as the SW renderer GS uses integer value and does integer operation too. This commit trunc the sampled texture, the interpoled fragment color and the product of the 2. It impacts negatively the perf of about 3/4% (GPU) but it fixes rendering on suikoden and potentially some others games too.	2015-07-17 21:08:49 +02:00
Gregory Hainaut	5f247a6e16	glsl: rewrite tfx function to ease future update No need to put lots of ifdef, compiler will optimize everything It increases a bit the readability	2015-07-17 21:08:49 +02:00
Gregory Hainaut	ea9e608288	gsdx-ogl: fix texture palette mask	2015-07-17 21:08:49 +02:00
Gregory Hainaut	b4c04ed00a	gsdx-ogl: wipeout subroutine code Code was completey bitrotten Code was a partial test (and yet 500 lines already) Shader is more and more complex and multithreading support greatly reduce the cost of shader switch	2015-07-17 21:08:49 +02:00
Gregory Hainaut	e3751f6cd9	glsl: disable computing of extra alpha coeff in SW blending Hum, I'm curious of the impact to enable only this code when it is actually used.	2015-07-17 21:08:49 +02:00
Gregory Hainaut	784822a5c2	glsl: redo blending management to use A/B/C/D directly 1/ Code is much more readable 2/ It will allow to round differently the operation in the future	2015-07-17 21:08:49 +02:00
Gregory Hainaut	91fbe6f108	gsdx-ogl: add some code to fix black netting on some renderings Code is not yet enabled because it requires extensive test The idea is to replace point by a 1 pixels sprite with the help of a geometry shader. In 4x, point will be replaced by a 4x4 sprite.	2015-07-11 14:35:35 +02:00
Gregory Hainaut	5e7ce63ed1	glsl: add a geometry shader to upscale GS_POINT The purpose is to avoid upscaling issue on POINT rendering (black netting on some FMVs)	2015-07-11 14:35:34 +02:00
Gregory Hainaut	986754bde7	glsl: don't declare gl_FragDepth Fix compilation issue on Mesa driver (hope it doesn't break Catalyst)	2015-07-05 18:12:36 +02:00
Gregory Hainaut	a9f49ab9ab	glsl: fix rounding error for shadow computation Better shadow on Castlevania/Nemo (others :) )	2015-07-01 09:36:54 +02:00
Gregory Hainaut	6121677aa1	gsdx-tc: use a single shader pass to convert texture in 8 bits format It might save a couple of fps Add a define to test the perf if we keep only the blue channel. It brokes the code in Prince Of Persia that use the Red/Green channel... Maybe the speed hack :( Or find a way to replace all if with a lookup table Note: it is only supported on OpenGL currently	2015-07-01 09:36:54 +02:00
Gregory Hainaut	e2727f10ba	glsl: fix for AMD I hope :)	2015-07-01 09:36:54 +02:00
Gregory Hainaut	d29e375f72	gsdx-tc: GPU accelerate 8 bits texture conversion Only native is supported currently	2015-07-01 09:36:54 +02:00
Gregory Hainaut	db5468afcf	glsl: fix shadow on Shin Megami Tensei Nocturne	2015-07-01 09:36:54 +02:00
Gregory Hainaut	839003467e	gsdx-ogl: add support of partial frame buffer masking It might help to fix a bit the color on a couple of games accurate_fbmask = 1 Code uses GL4.5 extensions. So far it seems the effect is ony used a couple of time and often in non-overlapping primitive. Speed impact will likely remain small	2015-07-01 09:36:53 +02:00
Gregory Hainaut	82818dab3c	gsdx-ogl: make some room in AlphaCoefficient variable The idea will be to use the remaining int to store the FB mask	2015-07-01 09:36:53 +02:00
Gregory Hainaut	42e911c78f	gsdx-ogl: mask alpha channel in depth conversion Might not work if depth is recasted as a rt (we lost the alpha channel)	2015-07-01 09:30:20 +02:00
Gregory Hainaut	58ce7d4bb8	gsdx-ogl: emulate texture shuffle GS doesn't supports texture shuffle/swizzle so it is emulated in a complex way. The idea is to read/write the 32 bits color format as a 16 bit format. This way, RG (16 lsb bits) or BA (16 msb bits) can be read or written with square texture that targets pixels 1-8 or pixels 8-16. However shuffle is limited. For example you can copy the green channel to either the alpha channel or another green channel. Note: Partial masking of channel is not yet implemented V2: improve logging V3: better support of green channel in shader V4: improve detection of destination (issue due to rounding)	2015-07-01 09:30:20 +02:00
Gregory Hainaut	8393ba56d6	gsdx-ogl: rework palette texture handling Redirect the red channel to alpha channel for 8 bits texture. It avoid special management in the shader	2015-06-24 19:50:09 +02:00
Gregory Hainaut	23f8203a22	gsdx-ogl: add some shaders to convert depth <-> color I'm afraid of rounding error	2015-06-07 12:39:00 +02:00
Gregory Hainaut	419dfe0544	glsl: redo color/alpha management correction Please test it! GS supports 3 formats for the output: 32 bits: normal case => no change 24 bits: like 32 bits but without alpha channel => mask alpha channel (ie don't write it anymore) => Always uses 1.0f as blending coefficient 16 bits: RGB5A1, emulated by a 32 bits openGL texture. I think it will be more correct to use a real 16 bits GL texture. Unfortunately it would cost several (slow) target conversions. Anyway as a current solution => apply a mask of 0xF8 on color when SW blending is used (improve Castlevania shadow) unfortunately normal blending mode still uses the full range of colors! This commit also corrects a couple of blending factor. 128/255 is equivalent to 1.0f in PS2, whereas GPU uses 1.0f. So the blending factor must be 255/128 instead of 2 Note: disable CRC hack and enable accurate_colclip to see Castlevania shadow ^^ (issue #380). Note2: SW renderer is darker on Castlevania. I don't know why maybe linked to the 16 bits format poorly emulated	2015-05-26 16:49:43 +02:00
Gregory Hainaut	b0af54d33e	gsdx-ogl: better support of palette The purpose of the code is to support alpha channel of RT uses as an index for a palette texure. I'm afraid that code will likely break pure palette texture. Only used if paltex is enabled It fixes missing shadow in Star Ocean 3 (issue #374) in Native resolution with filter = 0 (no filtering) or = 2 (normal fitering) Rendering explanation: The game emulates a stencil buffer with the alpha channel The alpha channel of the RT can contains a palette texture index (format 4HH) The idea is to have a gradient of value in the palette (16/32/48/...). This way you can implement a +16/-16 and even wrap the alpha value every time you hit the pixel. Bilinear filtering breaks the rendering because it interpolates between counts so you doesn't have the exact count Upscaling breaks the rendering because the RT is reused as an input texture. It means that we need to scale it down which again create some interpolations.	2015-05-24 18:07:16 +02:00
Gregory Hainaut	a70c3bf5de	glsl: correct the alternate implementation of ps_main1 Still not yet enabled by default Potentially it can be optimized with the dot product but special care need to be taken to ensure float accuracy. Bonus: it could work on old GPU (aka DX9)	2015-05-24 13:43:36 +02:00
Gregory Hainaut	3d35e4f8ec	gsdx-ogl: implement all blends mode in shader + 250 lines !!!	2015-05-20 09:11:36 +02:00
Gregory Hainaut	8d3e3e6c5b	gsdx-ogl: more blend rework to support accurate_colclip So far few blending equations are implemented in PS. It is only for test the behavior on GoW	2015-05-20 08:00:40 +02:00
Gregory Hainaut	c5341a2711	gsdx-ogl: update blending management This way it will allow to implement all blendings operartion in FS. Of course it will be slow, but it would be nice for debug and quickly check game error rendering.	2015-05-20 00:12:52 +02:00
Gregory Hainaut	358e0d445b	gsdx-ogl: add a shader to convert depth texture into uint	2015-05-19 18:56:32 +02:00
Gregory Hainaut	ec45090358	glsl: implement an alternative of convert 1 shader Code not yet enabled. Quick test show no heavy regression but I need to carefully validate it.	2015-05-19 13:14:18 +02:00
Gregory Hainaut	51ed3e63e9	glsl: oups forget to remove a line	2015-05-10 10:39:23 +02:00
Gregory Hainaut	5565544ba6	gsdx-ogl: DATE with texture barrier Much faster for small batch that write the alpha value. Code can be enabled with accurate_date option. Here a summary of all DATE possibilities: 1/ no overlap of primitive => texture barrier (pro no setup of stencil and single draw) 2/ alpha written => small batch => texture barrier (primitive by primitive). Done in N-primitive draw calls. (based on GL_ARB_texture_barrier) => bigger batch => compute the first good primitive, slow but only 2 draw calls. (based on GL_ARB_shader_image_load_store) => Otherwise there is the UserHacks_AlphaStencil but it is a hack! 3/ alpha written => full setup of stencil ( 2 draw calls)	2015-05-09 19:54:01 +02:00
Gregory Hainaut	7fac25edb0	glsl: add shader implementation of impossible blend Issue: I don't know if destination alpha value must be multiplied by 2.0f	2015-05-09 15:02:34 +02:00
Gregory Hainaut	f029e4763f	gsdx-ogl: add the alpha value in a constant buffer	2015-05-09 15:02:34 +02:00
Gregory Hainaut	9023f5f0db	glsl: drop useless ifndef I often mix them with the real value... .	2015-05-08 19:28:17 +02:00
Gregory Hainaut	1addae1993	gsdx-ogl: don't use extra shader for sprite hack Atst == 2 && sprite_hack is equivalent to Atst == 1 It frees a bit in the shader selector, and reduces shader combinations.	2015-05-08 19:28:16 +02:00
Gregory Hainaut	530e4ce776	gsdx-ogl: drop hack that rescale primitive (to avoid upscale glitch) Rendering is bad. It renders sprites at native resolution.	2015-05-06 19:09:13 +02:00
Gregory Hainaut	ae70344fbc	gsdx-ogl: fix 2 passes DATE with GL42 (override_GL_ARB_shader_image_load_store = 1) It fixes shadows issue with Persona 3	2015-05-02 16:56:18 +02:00
Gregory Hainaut	73d04e33e9	gsdx ogl: clean various comment and old code	2015-05-01 20:04:23 +02:00
Gregory Hainaut	335695bd0e	purge GLES from GSdx ! mobile will use vulkan (or any new API) anyway	2015-05-01 20:02:17 +02:00
Gregory Hainaut	71e517108d	gsdx-ogl: fix colclip Sotc shadow and likely various games :) Thanks to pseudo	2015-04-30 23:08:05 +02:00
Gregory Hainaut	b65a62096f	gsdx-ogl: drop support of ENABLE_OGL_DEBUG Stencil can be read by GL debugger due to correct mask configuration	2015-04-30 20:02:51 +02:00
Gregory Hainaut	eaa0dae305	glsl: drop invariant qualifier Fix issue #510 and potentially it allows the compiler to better optimize the code	2015-04-27 18:52:55 +02:00
Gregory Hainaut	46ff4dc3d3	gsdx-ogl: hardware unit only support normalization of 4 bytes... (At least on recent AMD GPU)	2015-04-27 18:51:59 +02:00
Gregory Hainaut	301fe869c7	glsl: improve the logz handling This way only 0 and 1 will conflict	2015-04-26 17:51:33 +02:00
Gregory Hainaut	6a3250cad2	gsdx-ogl: fix logz management (FFXII) The issue is that z+1 overflow if z is UINT max log2(z) overflow if z is 0 So let's use an or 1 to avoid both overflow	2015-04-25 20:23:26 +02:00
Gregory Hainaut	15ae9996bb	glsl: format white space + comment	2015-04-25 12:50:12 +02:00
Gregory Hainaut	36514bd95f	glsl: fog is a single byte Give a chance to the driver to optimize if possible	2015-04-24 21:37:37 +02:00
Gregory Hainaut	c207632e49	gsdx-ogl: improve date performance for GL45 If there is no overlap, it is allowed to directly read from the render target. On SotC testcase with 6x scaling: 30fps -> 40fps Note: it requires GL_ARB_texture_barrier extension so be sure to have a recent driver Note2: it requires a lots of testing too Open question: in case of complex date (written alpha) Will it be faster to split the draw call into multiple call with no primitive overlap	2015-04-24 21:12:33 +02:00
Gregory Hainaut	795ae50ecd	gsdx-ogl: fix the recently broken advance date feature Now it is really working with a 2 stages shaders but it is still slow.	2015-04-24 20:13:38 +02:00
Gregory Hainaut	89d5e5637c	glsl: use an explicit cast insead of notEqual function If the compiler didn't optimize the code, it will be a bit faster	2015-04-24 18:01:25 +02:00
Gregory Hainaut	56836561f4	glsl: replace runtime condition by preprocessor condition It might be an easier work for the compiler I didn't replace all occurences to keep readability	2015-04-24 17:51:29 +02:00
Gregory Hainaut	62489f42f1	gsdx-ogl: add an optimization note for later Only 1 byte of fog is useful	2015-04-20 07:18:09 +02:00
Gregory Hainaut	6d253c0b8f	glsl: fix debugging of tex coordinate in apitrace	2015-04-20 07:18:08 +02:00
Gregory Hainaut	31f8c065db	gsdx-ogl: implement a new hack UserHacks_UnscaleSprite for opengl UserHacks_UnscaleSprite = 1 will unscale flat sprites UserHacks_UnscaleSprite = 2 will unscale all sprites (don't work well so far) The idea of the hack is to redo the interpolation of texture coordinate based on the non-upscaled pixel position. It avoids various glitches but sprites aren't upscaled anymore (so no more anti-aliasing, potentially a coefficient can be added).	2015-04-20 07:18:08 +02:00
Gregory Hainaut	15264c6c63	glsl: split the main shader * separate VS/GS and FS * separate subroutine part of the FS It already complex enough without subroutine stuff. Besides I'm not sure we will keep subroutine on the future.	2015-04-19 18:49:02 +02:00
Gregory Hainaut	418f2e69a8	gsdx-ogl: implement the wildhack on the GPU Likely much faster for opengl and much easier to implement Note: hopefully UserHacks_round_sprite_offset will replace it	2015-04-13 22:14:36 +02:00
Gregory Hainaut	05eb1641f2	gsdx: add glsl debug capabilities Useful to debug bad texture sampling	2015-04-08 10:15:25 +02:00
Gregory Hainaut	e40a26be19	gsdx: add some helper to debug openGL renderer	2015-04-03 20:28:44 +02:00
Gregory Hainaut	29a14f5667	sed -i -e 's/seperately/separately/' Thanks micove again	2014-12-07 12:46:16 +01:00
Gregory Hainaut	16284d0a59	gsdx: fix a couple of typo From @micove	2014-12-07 12:12:53 +01:00
Gregory Hainaut	d21e6ff45f	gsdx-ogl: don't declare a sampler variable Nvidia was complaining on the fxaa shader. In doubt I also replaced it in Asmodean's shader	2014-11-20 22:07:44 +01:00
Gregory Hainaut	8c90e7cafc	gsdx-ogl: support latest fxaa version Only tested on Nvidia, please report any issue with your driver Note: requires GL4 GPU	2014-11-10 10:39:55 +01:00
Gregory Hainaut	84f844767c	gsdx-ogl: micro optimize PSConstantBuffer cache Might help to save a cache line on the CPU :)	2014-11-08 21:39:17 +01:00
Gregory Hainaut	ccc1137e12	gsdx-ogl: merge the two vertex buffer format * Only a single VAO => Format is set once => Only a single bind at startup => GSVertexBufferStateOGL is nearly useless => barely faster but better than nothing :)	2014-10-02 20:44:22 +02:00
Gregory Hainaut	d37cc8e1e7	gsdx-ogl: GL_ARB_shading_language_420pack is now mandatory All API/drivers support it so time to remove the fallback.	2014-10-02 20:44:22 +02:00
Gregory Hainaut	594f6c33a2	gsdx-ogl-ES: require GL_EXT_shader_io_blocks + GLES3.1 Allow to use same shader interface for all API Note: on the GL API it will require GL3.3 (see next commit)	2014-09-30 22:18:01 +02:00
Gregory Hainaut	4659184cc1	gsdx-ogl: add support of clip_control (depth only) * replace the [-1;1] depth range of openGL with the DX range [0;1].	2014-09-28 12:00:34 +02:00
Gregory Hainaut	0d45e6d70e	gsdx-ogl: avoid to send constant to the GPU It was a waste of bandwith	2014-04-06 10:44:40 +02:00
gregory.hainaut	384c0c12ea	gsdx ogl: * properly detect gl nv depth extension * Always show the hack on the gui. Add a new hack option for DATE (gl4.2) only * Save the scan mode on linux too (f7) * hopefully fix some crash on some drivers... (ensure aligment 256 bits alignment, and if not use std memcpy) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5888 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-07 19:53:01 +00:00
gregory.hainaut	6f3326a9a4	gsdx ogl: Fog hide this typo :p git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5861 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-01 10:14:01 +00:00
gregory.hainaut	84895eadd9	gsdx ogl: correct most of Z-depth issue Best setting if you driver support GL_NV_depth_buffer_float => GL_NV_Depth = 1 & logz = 0 Otherwise => GL_NV_Depth = 0 & logz = 1 Explanation of the bug: Dx z position ranges from 0.0f to 1.0f (FS ranges 0.0f to 1.0f) GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f) Why it sucks: GS small depth value will be "mapped" to -1.0f. In others all small values will be 1.0f! Terrible lost of accuraccy. The GL_NV_depth_buffer_float extension allow to set the near plane as -1.0f. So "GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f)" will become "GL z Position ranges from -1.0f to 1.0f (FS ranges -1.0f to 1.0f)" and therefore "z posision [0.0f;1.0f] will map to FS [0.0f;1.0f]" as DX Yes we just get back all precision lost previously :) However you need hardware (intel?) and driver support (free driver?/gles?) :( git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5860 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-31 21:30:54 +00:00
gregory.hainaut	c2aa4ff3fd	gsdx ogl: * restore the old fxaa (Asmodeam will be integrated when I got time) * port the recently added new scanline algo git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5818 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-18 14:46:13 +00:00
gregory.hainaut	23ae6f0641	the regular copy/past mistake git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5811 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 12:35:15 +00:00
gregory.hainaut	2238095a82	gsdx ogl: do the same as previous commit but for ogl ;) * fix the missing auto interlace opt when cycling with hotkey on linux git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5810 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 11:38:50 +00:00
gregory.hainaut	f5257b2c3c	gsdx ogl: don't rely on GL_ES builtin (3 differents implementations...). Hopefully fix Linux Intel GPU. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5808 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 15:01:13 +00:00
gregory.hainaut	e80b002929	gsdx ogl: Flush various pending work * try to use more subroutine on VS&PS, unfortunately hit a driver crash! * Call Attach/DetachContext through GSDevice so I can unmap currently mapped buffer * Implement glsl part of GL_ARB_bindless texture, again hit another driver crash! * various fix of GL_ARB_buffer_storage. Basic benchmark show only improvement on 'cold' case, I guess it will improve smoothness * try to fix GL_clear_texture, no success so far. It seem the extension is limited to basic texture (aka no depth/stencil) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5752 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-10-24 20:54:27 +00:00
gregory.hainaut	e01c6cd9ce	gsdx ogl: the proof of concept commit * GL_ARB_shader_subroutine for perf fix for nvidia => add missing shader declaration. Nvidia got +4fps on colin3 :) For the moment only 2 PS parameters are supported. Code need to be extended to support others games that often switch shader program (like xenosaga). require GL4 class hardware and the option override_GL_ARB_shader_subroutine = 1 Note: strangely on AMD linux it is slower! * GL_ARB_shader_image_load_store for accuraccy (Date) Use a signed integer texture and reenable color buffer writing Current status: Amagami_transparency.gs & P3_battle_shadows.gs are now working on Nvidia with a small perf impact. Current implementation detail: 1/ setup the standard stencil as before 2/ on remaining pixel, draw once to compute first primitive that will write a fail alpha value. 3/ final draw based on primitive id of step 2 Note: I think we would get a bad behavior if depth test&mask are enabled on step 2/3 Note2: on my limited testcase the perf impact was on CPU. It would be possible to merge step1&2 to nullifying it (could even be faster actually), however it would require more GPU power. Again require GL4 class hardware. And the option UserHacks_DateGL4 = 1 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5725 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-28 08:44:16 +00:00
gregory.hainaut	07605941ef	gsdx ogl: * some preliminary work to test/benchmark bindless texture in the future (glsl was not yet updated) Bindless texture allow to get a GPU texture pointer and then set it directly to the shader as a basic uniform. => no more texture unit selection/validation => no more texture validation neither texture hash lookup 3rdparty: update gl header to the latest gl4.4 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5720 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-17 08:57:52 +00:00

1 2 3 4

158 Commits