pcsx2

Commit Graph

Author	SHA1	Message	Date
Gregory Hainaut	b17803bb34	gsdx-ogl: wipeout of GL_ARB_bindless_texture Code is completely broken. It doesn't help to improve speed. Remove 200 lines ;)	2015-08-08 09:16:49 +02:00
Gregory Hainaut	36554c3375	Merge branch 'hdr-colclip-32bits'	2015-08-04 21:55:40 +02:00
Gregory Hainaut	45bb27d6db	gsdx-ogl: extend HDR colclip to 32 bits texture Unfortunately 16 bits wasn't enough for Castlevania.	2015-08-04 21:54:27 +02:00
Gregory Hainaut	744f9ebc09	gsdx-ogl: rare corner case when both texture shuffle and date are enabled In texture shuffle mode the texture data is either RG or BA. It means that DATE must either checks MSB of G or A. Close #693	2015-08-04 20:10:44 +02:00
gabest11	542dd33169	Merge branch 'master' of https://github.com/pcsx2/pcsx2	2015-08-02 20:27:33 +02:00
gabest11	a1a842b07f	gsdx: date/datm fix for 16-bit frame buffer in sw rendering mode	2015-08-02 20:21:49 +02:00
Gregory Hainaut	4a3c145c72	gsdx-ogl: depth support: better support of 16 bits z buffer Fix issue in socom2	2015-08-01 01:28:41 +02:00
Gregory Hainaut	cfd0fd6cc8	gsdx-ogl: remove old colclip algo	2015-07-31 09:45:28 +02:00
Gregory Hainaut	93c47feb7c	gsdx-ogl: replace old colclip algo with the HDR algo Similar speed but more accurate Allow to clean the code	2015-07-31 09:45:28 +02:00
Gregory Hainaut	e026f1bac6	gsdx-ogl: implement a fast accurate colclip algo The idea is to use a floating texture to accumulate the data and then do a final postprocessing pass to apply the modulo v2: * use bounding box to * fix vertex corruption issue * use negative number in shader which allow to use half float (+12 fps@4x)	2015-07-30 18:34:52 +02:00
Gregory Hainaut	d191ae4aba	glsl: use rounding in convert 4 shader It will avoid texture rouding error with negative number	2015-07-30 18:22:59 +02:00
Gregory Hainaut	ae8df002af	gsdx-ogl: optimize Cs * As + Cd and Cs * Af + Cd blending Basically the code does the alpha multiplication in the shader therefore the blend unit only does a pure addition. This way the multiplication is accurate and accurate_blending doesn't requires a costly barrier. This code also avoid variable duplication to make the code more separated. Hopefully blending can be done in a separated function It is preliminary work to support fast color clipping with HDR v2: fix assertion compilation failure v3: fix regression in not accurate mode v3: Cs * As/Af is not an accumulation Those cases don't need the Cd addition and were already optimized anyway Fix a regression on GoW2	2015-07-30 18:22:59 +02:00
Gregory Hainaut	95c374bbcc	glsl: Round 0.95+ to 1 for texel colors It improves SotC blooming. Suikoden seems to be fine too.	2015-07-30 18:21:01 +02:00
Miguel A. Colón Vélez	e4ab71abf3	Add authors comment that suggest public-domain license. Sourceforge was dead for more than a week therefore add the license information. I could not find the original TGM source (dead link) so I'm not even sure if this still applies or if the glsl was totally rewritten. None of the glsl files have a copyright header so it's hard to tell.	2015-07-27 14:21:29 -04:00
Gregory Hainaut	223b7daa1d	gsdx-ogl: fix fbmask regression I forget to remove an useless 255.0f factor	2015-07-19 23:55:27 +02:00
Gregory Hainaut	5c740ff41e	gsdx-ogl: wipeout AlphaStencil & Alpha hack Accurate options do a better jobs. Technically it can still be useful for old gpu/driver that doesn't support the GL4.5 extension. On Windows, you can still rely on Dx On linux, free driver support it (except Intel)	2015-07-19 22:43:48 +02:00
Gregory Hainaut	38c086008d	glsl: round blending as SW renderer Better tone on God of War with ultra accurate mode	2015-07-18 17:16:46 +02:00
Gregory Hainaut	c4e165067d	glsl: use signed integer when working with substraction... It mostly fixes rendering of GoW with ultra blending Color is still a bit wrong but rounding is likely incorrect	2015-07-18 14:41:03 +02:00
Gregory Hainaut	c701ab4368	glsl: don't use normalized value for color range Globally shader uses less intruction (except blending part) It would also allow to improve the rounding of color	2015-07-18 14:41:03 +02:00
Gregory Hainaut	57394a03e0	partially revert: glsl: disable computing of extra alpha coeff in SW blending It generates undefined data in screen. Maybe blending isn't switch off properly. I need to investigate it.	2015-07-18 14:41:03 +02:00
Gregory Hainaut	036cb229a3	glsl: add various comment for future idea For example GL4 GPU supports special bit operation	2015-07-18 14:40:57 +02:00
Gregory Hainaut	36e83b42ce	glsl: round texture and fragment color as the SW renderer GS uses integer value and does integer operation too. This commit trunc the sampled texture, the interpoled fragment color and the product of the 2. It impacts negatively the perf of about 3/4% (GPU) but it fixes rendering on suikoden and potentially some others games too.	2015-07-17 21:08:49 +02:00
Gregory Hainaut	5f247a6e16	glsl: rewrite tfx function to ease future update No need to put lots of ifdef, compiler will optimize everything It increases a bit the readability	2015-07-17 21:08:49 +02:00
Gregory Hainaut	ea9e608288	gsdx-ogl: fix texture palette mask	2015-07-17 21:08:49 +02:00
Gregory Hainaut	b4c04ed00a	gsdx-ogl: wipeout subroutine code Code was completey bitrotten Code was a partial test (and yet 500 lines already) Shader is more and more complex and multithreading support greatly reduce the cost of shader switch	2015-07-17 21:08:49 +02:00
Gregory Hainaut	e3751f6cd9	glsl: disable computing of extra alpha coeff in SW blending Hum, I'm curious of the impact to enable only this code when it is actually used.	2015-07-17 21:08:49 +02:00
Gregory Hainaut	784822a5c2	glsl: redo blending management to use A/B/C/D directly 1/ Code is much more readable 2/ It will allow to round differently the operation in the future	2015-07-17 21:08:49 +02:00
Gregory Hainaut	91fbe6f108	gsdx-ogl: add some code to fix black netting on some renderings Code is not yet enabled because it requires extensive test The idea is to replace point by a 1 pixels sprite with the help of a geometry shader. In 4x, point will be replaced by a 4x4 sprite.	2015-07-11 14:35:35 +02:00
Gregory Hainaut	5e7ce63ed1	glsl: add a geometry shader to upscale GS_POINT The purpose is to avoid upscaling issue on POINT rendering (black netting on some FMVs)	2015-07-11 14:35:34 +02:00
Gregory Hainaut	986754bde7	glsl: don't declare gl_FragDepth Fix compilation issue on Mesa driver (hope it doesn't break Catalyst)	2015-07-05 18:12:36 +02:00
Gregory Hainaut	a9f49ab9ab	glsl: fix rounding error for shadow computation Better shadow on Castlevania/Nemo (others :) )	2015-07-01 09:36:54 +02:00
Gregory Hainaut	6121677aa1	gsdx-tc: use a single shader pass to convert texture in 8 bits format It might save a couple of fps Add a define to test the perf if we keep only the blue channel. It brokes the code in Prince Of Persia that use the Red/Green channel... Maybe the speed hack :( Or find a way to replace all if with a lookup table Note: it is only supported on OpenGL currently	2015-07-01 09:36:54 +02:00
Gregory Hainaut	e2727f10ba	glsl: fix for AMD I hope :)	2015-07-01 09:36:54 +02:00
Gregory Hainaut	d29e375f72	gsdx-tc: GPU accelerate 8 bits texture conversion Only native is supported currently	2015-07-01 09:36:54 +02:00
Gregory Hainaut	db5468afcf	glsl: fix shadow on Shin Megami Tensei Nocturne	2015-07-01 09:36:54 +02:00
Gregory Hainaut	839003467e	gsdx-ogl: add support of partial frame buffer masking It might help to fix a bit the color on a couple of games accurate_fbmask = 1 Code uses GL4.5 extensions. So far it seems the effect is ony used a couple of time and often in non-overlapping primitive. Speed impact will likely remain small	2015-07-01 09:36:53 +02:00
Gregory Hainaut	82818dab3c	gsdx-ogl: make some room in AlphaCoefficient variable The idea will be to use the remaining int to store the FB mask	2015-07-01 09:36:53 +02:00
Gregory Hainaut	42e911c78f	gsdx-ogl: mask alpha channel in depth conversion Might not work if depth is recasted as a rt (we lost the alpha channel)	2015-07-01 09:30:20 +02:00
$refractionpcsx2$ refractionpcsx2	af09d7e063	GSdx-DX: Modified shader slightly as some bits were incorrect.	2015-07-01 09:30:20 +02:00
$refractionpcsx2$ refractionpcsx2	4bc8bfc23e	GSdx-DX: Rough port of texture switching from OGL	2015-07-01 09:30:20 +02:00
Gregory Hainaut	58ce7d4bb8	gsdx-ogl: emulate texture shuffle GS doesn't supports texture shuffle/swizzle so it is emulated in a complex way. The idea is to read/write the 32 bits color format as a 16 bit format. This way, RG (16 lsb bits) or BA (16 msb bits) can be read or written with square texture that targets pixels 1-8 or pixels 8-16. However shuffle is limited. For example you can copy the green channel to either the alpha channel or another green channel. Note: Partial masking of channel is not yet implemented V2: improve logging V3: better support of green channel in shader V4: improve detection of destination (issue due to rounding)	2015-07-01 09:30:20 +02:00
Gregory Hainaut	8393ba56d6	gsdx-ogl: rework palette texture handling Redirect the red channel to alpha channel for 8 bits texture. It avoid special management in the shader	2015-06-24 19:50:09 +02:00
Gregory Hainaut	23f8203a22	gsdx-ogl: add some shaders to convert depth <-> color I'm afraid of rounding error	2015-06-07 12:39:00 +02:00
Gregory Hainaut	2cbde89084	Merge pull request #555 from PCSX2/real-fb-format GSdx: better framebuffer format	2015-06-01 11:48:07 +02:00
Gregory Hainaut	da0b77e09b	gsdx-ogl: new logo Thanks bmate for the logo. Close issue #192	2015-05-29 11:05:52 +02:00
Gregory Hainaut	419dfe0544	glsl: redo color/alpha management correction Please test it! GS supports 3 formats for the output: 32 bits: normal case => no change 24 bits: like 32 bits but without alpha channel => mask alpha channel (ie don't write it anymore) => Always uses 1.0f as blending coefficient 16 bits: RGB5A1, emulated by a 32 bits openGL texture. I think it will be more correct to use a real 16 bits GL texture. Unfortunately it would cost several (slow) target conversions. Anyway as a current solution => apply a mask of 0xF8 on color when SW blending is used (improve Castlevania shadow) unfortunately normal blending mode still uses the full range of colors! This commit also corrects a couple of blending factor. 128/255 is equivalent to 1.0f in PS2, whereas GPU uses 1.0f. So the blending factor must be 255/128 instead of 2 Note: disable CRC hack and enable accurate_colclip to see Castlevania shadow ^^ (issue #380). Note2: SW renderer is darker on Castlevania. I don't know why maybe linked to the 16 bits format poorly emulated	2015-05-26 16:49:43 +02:00
Gregory Hainaut	b0af54d33e	gsdx-ogl: better support of palette The purpose of the code is to support alpha channel of RT uses as an index for a palette texure. I'm afraid that code will likely break pure palette texture. Only used if paltex is enabled It fixes missing shadow in Star Ocean 3 (issue #374) in Native resolution with filter = 0 (no filtering) or = 2 (normal fitering) Rendering explanation: The game emulates a stencil buffer with the alpha channel The alpha channel of the RT can contains a palette texture index (format 4HH) The idea is to have a gradient of value in the palette (16/32/48/...). This way you can implement a +16/-16 and even wrap the alpha value every time you hit the pixel. Bilinear filtering breaks the rendering because it interpolates between counts so you doesn't have the exact count Upscaling breaks the rendering because the RT is reused as an input texture. It means that we need to scale it down which again create some interpolations.	2015-05-24 18:07:16 +02:00
Gregory Hainaut	a70c3bf5de	glsl: correct the alternate implementation of ps_main1 Still not yet enabled by default Potentially it can be optimized with the dot product but special care need to be taken to ensure float accuracy. Bonus: it could work on old GPU (aka DX9)	2015-05-24 13:43:36 +02:00
Gregory Hainaut	3d35e4f8ec	gsdx-ogl: implement all blends mode in shader + 250 lines !!!	2015-05-20 09:11:36 +02:00
Gregory Hainaut	8d3e3e6c5b	gsdx-ogl: more blend rework to support accurate_colclip So far few blending equations are implemented in PS. It is only for test the behavior on GoW	2015-05-20 08:00:40 +02:00
Gregory Hainaut	c5341a2711	gsdx-ogl: update blending management This way it will allow to implement all blendings operartion in FS. Of course it will be slow, but it would be nice for debug and quickly check game error rendering.	2015-05-20 00:12:52 +02:00
Gregory Hainaut	358e0d445b	gsdx-ogl: add a shader to convert depth texture into uint	2015-05-19 18:56:32 +02:00
Gregory Hainaut	ec45090358	glsl: implement an alternative of convert 1 shader Code not yet enabled. Quick test show no heavy regression but I need to carefully validate it.	2015-05-19 13:14:18 +02:00
Gregory Hainaut	51ed3e63e9	glsl: oups forget to remove a line	2015-05-10 10:39:23 +02:00
Gregory Hainaut	5565544ba6	gsdx-ogl: DATE with texture barrier Much faster for small batch that write the alpha value. Code can be enabled with accurate_date option. Here a summary of all DATE possibilities: 1/ no overlap of primitive => texture barrier (pro no setup of stencil and single draw) 2/ alpha written => small batch => texture barrier (primitive by primitive). Done in N-primitive draw calls. (based on GL_ARB_texture_barrier) => bigger batch => compute the first good primitive, slow but only 2 draw calls. (based on GL_ARB_shader_image_load_store) => Otherwise there is the UserHacks_AlphaStencil but it is a hack! 3/ alpha written => full setup of stencil ( 2 draw calls)	2015-05-09 19:54:01 +02:00
Gregory Hainaut	7fac25edb0	glsl: add shader implementation of impossible blend Issue: I don't know if destination alpha value must be multiplied by 2.0f	2015-05-09 15:02:34 +02:00
Gregory Hainaut	f029e4763f	gsdx-ogl: add the alpha value in a constant buffer	2015-05-09 15:02:34 +02:00
Gregory Hainaut	9023f5f0db	glsl: drop useless ifndef I often mix them with the real value... .	2015-05-08 19:28:17 +02:00
Gregory Hainaut	1addae1993	gsdx-ogl: don't use extra shader for sprite hack Atst == 2 && sprite_hack is equivalent to Atst == 1 It frees a bit in the shader selector, and reduces shader combinations.	2015-05-08 19:28:16 +02:00
Gregory Hainaut	530e4ce776	gsdx-ogl: drop hack that rescale primitive (to avoid upscale glitch) Rendering is bad. It renders sprites at native resolution.	2015-05-06 19:09:13 +02:00
Gregory Hainaut	ae70344fbc	gsdx-ogl: fix 2 passes DATE with GL42 (override_GL_ARB_shader_image_load_store = 1) It fixes shadows issue with Persona 3	2015-05-02 16:56:18 +02:00
Gregory Hainaut	73d04e33e9	gsdx ogl: clean various comment and old code	2015-05-01 20:04:23 +02:00
Gregory Hainaut	335695bd0e	purge GLES from GSdx ! mobile will use vulkan (or any new API) anyway	2015-05-01 20:02:17 +02:00
Gregory Hainaut	71e517108d	gsdx-ogl: fix colclip Sotc shadow and likely various games :) Thanks to pseudo	2015-04-30 23:08:05 +02:00
Gregory Hainaut	b65a62096f	gsdx-ogl: drop support of ENABLE_OGL_DEBUG Stencil can be read by GL debugger due to correct mask configuration	2015-04-30 20:02:51 +02:00
Gregory Hainaut	eaa0dae305	glsl: drop invariant qualifier Fix issue #510 and potentially it allows the compiler to better optimize the code	2015-04-27 18:52:55 +02:00
Gregory Hainaut	46ff4dc3d3	gsdx-ogl: hardware unit only support normalization of 4 bytes... (At least on recent AMD GPU)	2015-04-27 18:51:59 +02:00
Gregory Hainaut	301fe869c7	glsl: improve the logz handling This way only 0 and 1 will conflict	2015-04-26 17:51:33 +02:00
Gregory Hainaut	6a3250cad2	gsdx-ogl: fix logz management (FFXII) The issue is that z+1 overflow if z is UINT max log2(z) overflow if z is 0 So let's use an or 1 to avoid both overflow	2015-04-25 20:23:26 +02:00
Gregory Hainaut	15ae9996bb	glsl: format white space + comment	2015-04-25 12:50:12 +02:00
Gregory Hainaut	36514bd95f	glsl: fog is a single byte Give a chance to the driver to optimize if possible	2015-04-24 21:37:37 +02:00
Gregory Hainaut	c207632e49	gsdx-ogl: improve date performance for GL45 If there is no overlap, it is allowed to directly read from the render target. On SotC testcase with 6x scaling: 30fps -> 40fps Note: it requires GL_ARB_texture_barrier extension so be sure to have a recent driver Note2: it requires a lots of testing too Open question: in case of complex date (written alpha) Will it be faster to split the draw call into multiple call with no primitive overlap	2015-04-24 21:12:33 +02:00
Gregory Hainaut	795ae50ecd	gsdx-ogl: fix the recently broken advance date feature Now it is really working with a 2 stages shaders but it is still slow.	2015-04-24 20:13:38 +02:00
Gregory Hainaut	89d5e5637c	glsl: use an explicit cast insead of notEqual function If the compiler didn't optimize the code, it will be a bit faster	2015-04-24 18:01:25 +02:00
Gregory Hainaut	56836561f4	glsl: replace runtime condition by preprocessor condition It might be an easier work for the compiler I didn't replace all occurences to keep readability	2015-04-24 17:51:29 +02:00
Gregory Hainaut	62489f42f1	gsdx-ogl: add an optimization note for later Only 1 byte of fog is useful	2015-04-20 07:18:09 +02:00
Gregory Hainaut	6d253c0b8f	glsl: fix debugging of tex coordinate in apitrace	2015-04-20 07:18:08 +02:00
Gregory Hainaut	31f8c065db	gsdx-ogl: implement a new hack UserHacks_UnscaleSprite for opengl UserHacks_UnscaleSprite = 1 will unscale flat sprites UserHacks_UnscaleSprite = 2 will unscale all sprites (don't work well so far) The idea of the hack is to redo the interpolation of texture coordinate based on the non-upscaled pixel position. It avoids various glitches but sprites aren't upscaled anymore (so no more anti-aliasing, potentially a coefficient can be added).	2015-04-20 07:18:08 +02:00
Gregory Hainaut	16e6d0d305	glsl: move shader into a separate directory Only keep glsl_source.h for clarity	2015-04-19 18:49:02 +02:00
Gregory Hainaut	55fdf26898	glsl: remove the older file tfx.glsl	2015-04-19 18:49:02 +02:00
Gregory Hainaut	15264c6c63	glsl: split the main shader * separate VS/GS and FS * separate subroutine part of the FS It already complex enough without subroutine stuff. Besides I'm not sure we will keep subroutine on the future.	2015-04-19 18:49:02 +02:00
Gregory Hainaut	418f2e69a8	gsdx-ogl: implement the wildhack on the GPU Likely much faster for opengl and much easier to implement Note: hopefully UserHacks_round_sprite_offset will replace it	2015-04-13 22:14:36 +02:00
Gregory Hainaut	05eb1641f2	gsdx: add glsl debug capabilities Useful to debug bad texture sampling	2015-04-08 10:15:25 +02:00
Gregory Hainaut	e40a26be19	gsdx: add some helper to debug openGL renderer	2015-04-03 20:28:44 +02:00
Gregory Hainaut	29a14f5667	sed -i -e 's/seperately/separately/' Thanks micove again	2014-12-07 12:46:16 +01:00
Gregory Hainaut	16284d0a59	gsdx: fix a couple of typo From @micove	2014-12-07 12:12:53 +01:00
Gregory Hainaut	070dce4c83	Merge pull request #356 from PCSX2/issue-fixes various bug-tracker fixes	2014-12-06 19:48:13 +01:00
gabest11	b9b02cf749	implemented opencl program caching on disk under the system default temp folder, needs some additional work on linux	2014-12-02 00:16:35 +01:00
gabest11	e3ba15de94	opencl device selection in settings dialog	2014-12-02 00:16:34 +01:00
gabest11	6f5cd1cd4d	joined some tfx kernel calls, general speed up in most games	2014-12-02 00:16:34 +01:00
gabest11	3d2b0e3766	minor opencl kernel optimizations	2014-12-02 00:16:34 +01:00
gabest11	72cfc6a6ef	3rdparty/opencl	2014-12-02 00:16:34 +01:00
gabest11	9e20387595	Modified for opencl 1.1. While it runs on nvidia cards now, you can't use its sdk to compile gsdx, cl.hpp is missing there. Intel or amd is ok.	2014-12-02 00:16:34 +01:00
gabest11	263c097d13	solution for 32-bit z values in opencl and other minor optimizations	2014-12-02 00:16:34 +01:00
gabest11	c64f9ad9b1	squishing opencl bugs, there aren't many left hopefully	2014-12-02 00:16:34 +01:00
gabest11	ba1e522bbb	Less opencl bugs, some games are actually playable now, there are still texture errors.	2014-12-02 00:16:34 +01:00
gabest11	881735b562	minor merging errors	2014-12-02 00:16:33 +01:00
gabest11	db7c26cde7	- Experimental OpenCL renderer (missing features: point, line, texture cache, mipmap, aa1, device selection). Needs any OpenCL SDK for the common headers and stub lib to compile, tested with AMD and Intel. Too bad it is not part of the Windows SDK yet. - Renumbered renderer ids, compatible with old numbering, but it does not follow the mod3 logic anymore.	2014-12-02 00:16:33 +01:00
Gregory Hainaut	d21e6ff45f	gsdx-ogl: don't declare a sampler variable Nvidia was complaining on the fxaa shader. In doubt I also replaced it in Asmodean's shader	2014-11-20 22:07:44 +01:00
Gregory Hainaut	8c90e7cafc	gsdx-ogl: support latest fxaa version Only tested on Nvidia, please report any issue with your driver Note: requires GL4 GPU	2014-11-10 10:39:55 +01:00
Gregory Hainaut	84f844767c	gsdx-ogl: micro optimize PSConstantBuffer cache Might help to save a cache line on the CPU :)	2014-11-08 21:39:17 +01:00
Asmodean	df0a9c4683	Some small fixes, that I noticed.	2014-11-06 10:56:33 +00:00
Asmodean	937e475169	[Post-Processing] Update GSdx FX effect suite to latest version.	2014-11-03 07:15:34 +00:00
Gregory Hainaut	ccc1137e12	gsdx-ogl: merge the two vertex buffer format * Only a single VAO => Format is set once => Only a single bind at startup => GSVertexBufferStateOGL is nearly useless => barely faster but better than nothing :)	2014-10-02 20:44:22 +02:00
Gregory Hainaut	d37cc8e1e7	gsdx-ogl: GL_ARB_shading_language_420pack is now mandatory All API/drivers support it so time to remove the fallback.	2014-10-02 20:44:22 +02:00
Gregory Hainaut	594f6c33a2	gsdx-ogl-ES: require GL_EXT_shader_io_blocks + GLES3.1 Allow to use same shader interface for all API Note: on the GL API it will require GL3.3 (see next commit)	2014-09-30 22:18:01 +02:00
Gregory Hainaut	4659184cc1	gsdx-ogl: add support of clip_control (depth only) * replace the [-1;1] depth range of openGL with the DX range [0;1].	2014-09-28 12:00:34 +02:00
Gregory Hainaut	0d45e6d70e	gsdx-ogl: avoid to send constant to the GPU It was a waste of bandwith	2014-04-06 10:44:40 +02:00
Asmodean.pcsx2	0651e45460	GSdx: Just some cleaning. Reformatted fx files that were causing issues on certain text editors. They should now display correctly in those editors. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5897 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-11 04:27:32 +00:00
Asmodean.pcsx2	7d96af950e	GSdx: A few small post-processing changes, to accommodate future updates. Also removed the fallback recovery ps, and replaced the compile fail catch to a simple console print. Which I think is safer, and faster. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5894 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-09 01:01:41 +00:00
gregory.hainaut	384c0c12ea	gsdx ogl: * properly detect gl nv depth extension * Always show the hack on the gui. Add a new hack option for DATE (gl4.2) only * Save the scan mode on linux too (f7) * hopefully fix some crash on some drivers... (ensure aligment 256 bits alignment, and if not use std memcpy) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5888 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-07 19:53:01 +00:00
Asmodean.pcsx2	02afca3bb6	GSdx: Just some small post processing cleanup changes, should lower the FXAA compiling time on pause/resumes. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5872 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-02 17:50:32 +00:00
gregory.hainaut	6f3326a9a4	gsdx ogl: Fog hide this typo :p git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5861 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-01 10:14:01 +00:00
gregory.hainaut	84895eadd9	gsdx ogl: correct most of Z-depth issue Best setting if you driver support GL_NV_depth_buffer_float => GL_NV_Depth = 1 & logz = 0 Otherwise => GL_NV_Depth = 0 & logz = 1 Explanation of the bug: Dx z position ranges from 0.0f to 1.0f (FS ranges 0.0f to 1.0f) GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f) Why it sucks: GS small depth value will be "mapped" to -1.0f. In others all small values will be 1.0f! Terrible lost of accuraccy. The GL_NV_depth_buffer_float extension allow to set the near plane as -1.0f. So "GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f)" will become "GL z Position ranges from -1.0f to 1.0f (FS ranges -1.0f to 1.0f)" and therefore "z posision [0.0f;1.0f] will map to FS [0.0f;1.0f]" as DX Yes we just get back all precision lost previously :) However you need hardware (intel?) and driver support (free driver?/gles?) :( git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5860 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-31 21:30:54 +00:00
gregory.hainaut	c2aa4ff3fd	gsdx ogl: * restore the old fxaa (Asmodeam will be integrated when I got time) * port the recently added new scanline algo git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5818 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-18 14:46:13 +00:00
ramapcsx2.code	f8264bce2c	GSdx: Last patch regarding FXAA and external shader loading, I hope :) Changes are here: http://forums.pcsx2.net/Thread-Custom-Shaders-for-GSdx?pid=343285#pid343285 I've included bin\shader.fx and the settings file bin\GSdx_FX_Settings.ini. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5817 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-17 10:17:24 +00:00
ramapcsx2.code	4015db8403	GSdx: Newest scanlines shader looks much nicer this way :) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5814 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-13 18:23:53 +00:00
ramapcsx2.code	bdf5e650a6	GSdx: Quick workaround for DX9 until the shader is adapted. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5813 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-13 17:48:43 +00:00
ramapcsx2.code	1155dd6e21	GSdx: Add another scanline algorithm, made by pseudonym. This one is pretty fancy, using a cosinus function for generating the dark lines. It only works on unscaled output, so use software rendering or native resolution for hardware. It's an effect that works best on 240p converted games (like the SNES titles in Mega Man Collection). git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5812 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 19:47:49 +00:00
gregory.hainaut	23ae6f0641	the regular copy/past mistake git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5811 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 12:35:15 +00:00
gregory.hainaut	2238095a82	gsdx ogl: do the same as previous commit but for ogl ;) * fix the missing auto interlace opt when cycling with hotkey on linux git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5810 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 11:38:50 +00:00
ramapcsx2.code	2ac3c47c42	GSdx: Finish up my scanlines attempt. Now the 2 old shaders are back in and all 3 cycle on F7. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5809 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 23:17:07 +00:00
gregory.hainaut	f5257b2c3c	gsdx ogl: don't rely on GL_ES builtin (3 differents implementations...). Hopefully fix Linux Intel GPU. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5808 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 15:01:13 +00:00
sudonim1@gmail.com	a1f50d4306	GSDX: (FXAA) Treat shader model 4.1 as 4.0 and future shader models as 5 rather than failing to compile. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5803 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-09 18:32:26 +00:00
ramapcsx2.code	a1e387e23e	GSdx: Fix for the DX9 crashing issue, many thanks to Asmodean again! :) (Also sets new defaults with only minor processing, mainly FXAA now) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5798 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-06 18:03:47 +00:00
ramapcsx2.code	20ffe92ee8	GSdx: - FXAA shader replaced by Asmodeans shader suite 'PCSX2 Fx 2.00 Revised'. This is an entire enhancement suite with many configurable options like FXAA, texture sharpening or lighting tweaks. Some of the effects are pretty advanced so kudos to him allowing it to be integrated with GSdx! :) Note that there's no interface for the tweaks yet. Until those are done, the defaults will be used. Release thread here: http://forums.pcsx2.net/Thread-Custom-Shaders-for-GSdx?pid=334766#pid334766 - Disabled an option from the widescreen patches for SH:Shattered Memories. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5793 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-05 11:24:43 +00:00
ramapcsx2.code	5a53b682f0	GSdx: Added a simple scanlines filter (2nd one activated by F7). No tweaking done yet. It shows how to get scanlines that aren't affected by input resolution or scaling. A good next step would be determining the exact number of dark lines to show :) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5770 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-12-22 15:36:47 +00:00
gregory.hainaut	e80b002929	gsdx ogl: Flush various pending work * try to use more subroutine on VS&PS, unfortunately hit a driver crash! * Call Attach/DetachContext through GSDevice so I can unmap currently mapped buffer * Implement glsl part of GL_ARB_bindless texture, again hit another driver crash! * various fix of GL_ARB_buffer_storage. Basic benchmark show only improvement on 'cold' case, I guess it will improve smoothness * try to fix GL_clear_texture, no success so far. It seem the extension is limited to basic texture (aka no depth/stencil) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5752 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-10-24 20:54:27 +00:00
gregory.hainaut	e01c6cd9ce	gsdx ogl: the proof of concept commit * GL_ARB_shader_subroutine for perf fix for nvidia => add missing shader declaration. Nvidia got +4fps on colin3 :) For the moment only 2 PS parameters are supported. Code need to be extended to support others games that often switch shader program (like xenosaga). require GL4 class hardware and the option override_GL_ARB_shader_subroutine = 1 Note: strangely on AMD linux it is slower! * GL_ARB_shader_image_load_store for accuraccy (Date) Use a signed integer texture and reenable color buffer writing Current status: Amagami_transparency.gs & P3_battle_shadows.gs are now working on Nvidia with a small perf impact. Current implementation detail: 1/ setup the standard stencil as before 2/ on remaining pixel, draw once to compute first primitive that will write a fail alpha value. 3/ final draw based on primitive id of step 2 Note: I think we would get a bad behavior if depth test&mask are enabled on step 2/3 Note2: on my limited testcase the perf impact was on CPU. It would be possible to merge step1&2 to nullifying it (could even be faster actually), however it would require more GPU power. Again require GL4 class hardware. And the option UserHacks_DateGL4 = 1 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5725 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-28 08:44:16 +00:00
gregory.hainaut	07605941ef	gsdx ogl: * some preliminary work to test/benchmark bindless texture in the future (glsl was not yet updated) Bindless texture allow to get a GPU texture pointer and then set it directly to the shader as a basic uniform. => no more texture unit selection/validation => no more texture validation neither texture hash lookup 3rdparty: update gl header to the latest gl4.4 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5720 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-17 08:57:52 +00:00
gregory.hainaut	b4084047be	gsdx ogl: Used a basic flat interpolation for color interpolation (line & tri primitives) Card that support gs: remain only a gs to generate sprite from a line. Even dummy gs are costly for the GPU. Card that don't support gs: remove useless copy of color for line and triangle primitives Note for dx: opengl 3.2 (maybe not gles) supports both flat interpolation convention (GL_FIRST_VERTEX_CONVENTION or GL_LAST_VERTEX_CONVENTION). It might be possible to shuffle vertex index to put the last vertex in first position. - buff[0] = head + 0; - buff[1] = head + 1; - buff[2] = head + 2; + buff[0] = head + 2; + buff[1] = head + 1; + buff[2] = head + 0; git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5718 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-14 10:18:38 +00:00
gregory.hainaut	0f603a98d5	gsdx ogl: Test the ARB_shader_subroutine GL4.0 extension The idea was to replace shader program swith by pointer function calls inside shaders. At least parameters that are often changed between draw call. So far I only ported atst and colclip. Unfortunately code is "slower" (on GSdx standalone). For the moment keep the code but disabled. If I understand well the validation of program is done in the "driver thread" but the additional call are done in the overloaded MTGS thread. Apitrace profiling shows faster GPU draw calls. Another possibility is that the driver still need to validate the draw call because of others state change. Here some stats on colin3 (90 frames): without subroutine: UseProgram 125246 with subroutine: UseProgram 2906, subroutine 125945 => 3605 extra calls overhead (not all parameters are ported to subroutine) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5715 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-10 19:43:59 +00:00
gregory.hainaut	64f783410e	gsdx ogl: * preliminary work for GL4.4 extensions (ARB_clear_texture & ARB_multi_bind). Disabled until I got a 4.4 driver Note: I plan also to use ARB_buffer_storage * compute texture gl option in the constructor (avoid a couple of swith case) * redo texture unit management. Unit 0-2 for shaders, Unit 3 for texture operations. MultiBind will allow to bind shader input without disturbing texture binding points. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5711 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-02 16:38:12 +00:00
gregory.hainaut	e394de86dc	gsdx ogl: * add a non-working hack: UserHacks_DateGL4, goal was to replace UserHacks_AlphaStencil + Detection of good/bad samples is based on primitive ID variable. However I'm not sure the behavior is always the same between draw call...Anyway let's keep a copy of the current work * Dump integer texture into text csv * add gl4.2 ARB_shader_image_load_store extension (needed by UserHacks_DateGL4) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5707 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-28 14:40:43 +00:00
gregory.hainaut	b912778a3c	gsdx: forget to regenerate shader in previous commit git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5702 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-13 16:31:19 +00:00
gregory.hainaut	916a091f8a	gsdx ogl: GLes * use gles header file, disable opengl code (mainly GLX, ARB_sso, geometry shader) * Define properly the function pointer, GLES use basic linking whereas GL must get the symbol dynamically * cmake: properly search and set libglesv2.so * don't use dual source blending => HW renderer work (only miss unimportant FBA) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5701 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-13 11:39:45 +00:00
gregory.hainaut	bd5f379044	gsdx ogl: * uniform was wrongly set before the activation of the program (free driver only) * Always use only 1 drawbuffer. Easier besides previous setup was wrong for convert:ps_main1 GLES trial (v3): * add the cmake option GLES_API. Note library (libgles) are hardcoded for the moment * Disable opengl check * Disable gl_GetDebugMessageLogARB not supported! * Emulate gl_DrawElementsBaseVertex, add manually the index offset (surely slow but work) * Fix hundred of shader error (no implicit cast of integer to float...) Unfortunately GLES doesn't support dual blending so no blend in hardware renderer. Otherwise it is fine git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5700 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-12 21:12:34 +00:00
gregory.hainaut	a764950468	gsdx ogl: * replace vertex interface with block interface. It avoid to depends on the ARB_sso extension. * disable geometry shader on Nvidia & Linux. Slower but better than a black screen ! * default logz to 1, avoid some glitches. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5699 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-11 17:08:42 +00:00
gregory.hainaut	f22b366cea	gsdx: * redo glsl2h.pl script to generate only one big glsl headers. * fix gcc warning in GSVector.h * fix memory leak of GSDeviceOGL.m_shader * clean shader compilation function => split generation header & drop malloc stuff cmake: * only rebuild shader when asked by the use. Avoid perl dependency to build pcsx2 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5698 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-07 16:13:11 +00:00
gregory.hainaut	544c84a344	gsdx: * try to setup advance gl context attribute. If driver doesn't support it fallback to the default. zzogl: * use glsl2h to generate an header shader as GSdx. Much easier to install * Get GSUniformBufferOGL & GSVertexArrayOGL speed improvement from GSdx cmake: * detect current gcc version. Yield a warning if < 4.7 or an error if < 4.5 glsl2h: * support zzogl too * compute md5sum to avoid useless relink git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5696 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-06 09:42:46 +00:00
gregory.hainaut	d81254469a	gsdx ogl: * fix shader compilation on Nvidia (broken on r5682) * fix various memory leak thanks to Valgrind Cmake: fix a minor typo git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5688 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-30 11:18:46 +00:00
gregory.hainaut	ca1edbf2cb	gsdx ogl: * Separate state and shader compilation into separate function * replace various hash_map by basic array * Compact VertexScale and offset into a single vec4 * add the new option "ogl_vertex_subdata": subdata is faster on FGLRX, test are welcome on Nvidia drivers 0 => use map/unmap 1 => use subdata replay: add "linux_replay" option and compute some nice stat (mean, standard deviation) cmake: recreate shader header at build time git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5682 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-26 20:09:07 +00:00
gregory.hainaut	b10d554764	gsdx (ogl): * fix memory leak of m_wnd * don't escape % in the shader string * Fix shadeboost StretchRect parameter debian: compress package with xz git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5669 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-16 07:05:57 +00:00
gregory.hainaut	4c37d25134	gsdx-ogl-wnd: fix ifdef mess on fxaa (both dx and ogl) git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5665 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-15 10:35:10 +00:00
gregory.hainaut	adc232cb95	gsdx-ogl-wnd: VS2010 doesn't support string bigger than 64k bytes. So remove PS3&360 shader from fxaa git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5664 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-15 10:06:47 +00:00
gregory.hainaut	93d2cd6f7e	gsdx-ogl-wnd: ... forget to add the file... git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5661 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-15 07:22:11 +00:00
gregory.hainaut	59501227b3	gsdx-ogl-wnd: implement fxaa. I directly reuse the .fx file with minor update git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5659 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-14 21:22:44 +00:00
gregory.hainaut	92a3549e9c	gsdx-ogl-wnd: * Patch from miseru to fix VS2010 * Quick fix for 8-bits textures git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5658 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-14 16:22:22 +00:00
gregory.hainaut	e021630bb9	gsdx-ogl-wnd: * Don't write color during stencil. Keep the old method to ease debug * Realign date shader on DX * keep stencil ref to 1. Reduce stencil management burder * Fix texture pitch (was in pixels but need bytes). Fix Bleach Blade Battlers 2. Thanks Miseru for your trace Remember note: fxaa -> not yet implemented msaa -> not implemented (I might drop it actually) 8 bit texture -> not yet implemented git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5657 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-14 11:34:44 +00:00
gregory.hainaut	65b5a376ed	gsdx-ogl-wnd: * Fix shader alpha issue. Hoppefully fix lots of glitches. * Use glBufferSubData to upload data into the constant buffer instead of map/unmap. More fps :) * Fix wrong api setup on EGL * Be more portable on glsl2h git-svn-id: http://pcsx2.googlecode.com/svn/branches/gsdx-ogl-wnd@5650 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-06-07 19:16:27 +00:00

1 2 3 4 5 ...

365 Commits