pcsx2

Commit Graph

Author	SHA1	Message	Date
Gregory Hainaut	42e911c78f	gsdx-ogl: mask alpha channel in depth conversion Might not work if depth is recasted as a rt (we lost the alpha channel)	2015-07-01 09:30:20 +02:00
$refractionpcsx2$ refractionpcsx2	af09d7e063	GSdx-DX: Modified shader slightly as some bits were incorrect.	2015-07-01 09:30:20 +02:00
$refractionpcsx2$ refractionpcsx2	4bc8bfc23e	GSdx-DX: Rough port of texture switching from OGL	2015-07-01 09:30:20 +02:00
Gregory Hainaut	58ce7d4bb8	gsdx-ogl: emulate texture shuffle GS doesn't supports texture shuffle/swizzle so it is emulated in a complex way. The idea is to read/write the 32 bits color format as a 16 bit format. This way, RG (16 lsb bits) or BA (16 msb bits) can be read or written with square texture that targets pixels 1-8 or pixels 8-16. However shuffle is limited. For example you can copy the green channel to either the alpha channel or another green channel. Note: Partial masking of channel is not yet implemented V2: improve logging V3: better support of green channel in shader V4: improve detection of destination (issue due to rounding)	2015-07-01 09:30:20 +02:00
Gregory Hainaut	8393ba56d6	gsdx-ogl: rework palette texture handling Redirect the red channel to alpha channel for 8 bits texture. It avoid special management in the shader	2015-06-24 19:50:09 +02:00
Gregory Hainaut	23f8203a22	gsdx-ogl: add some shaders to convert depth <-> color I'm afraid of rounding error	2015-06-07 12:39:00 +02:00
Gregory Hainaut	2cbde89084	Merge pull request #555 from PCSX2/real-fb-format GSdx: better framebuffer format	2015-06-01 11:48:07 +02:00
Gregory Hainaut	da0b77e09b	gsdx-ogl: new logo Thanks bmate for the logo. Close issue #192	2015-05-29 11:05:52 +02:00
Gregory Hainaut	419dfe0544	glsl: redo color/alpha management correction Please test it! GS supports 3 formats for the output: 32 bits: normal case => no change 24 bits: like 32 bits but without alpha channel => mask alpha channel (ie don't write it anymore) => Always uses 1.0f as blending coefficient 16 bits: RGB5A1, emulated by a 32 bits openGL texture. I think it will be more correct to use a real 16 bits GL texture. Unfortunately it would cost several (slow) target conversions. Anyway as a current solution => apply a mask of 0xF8 on color when SW blending is used (improve Castlevania shadow) unfortunately normal blending mode still uses the full range of colors! This commit also corrects a couple of blending factor. 128/255 is equivalent to 1.0f in PS2, whereas GPU uses 1.0f. So the blending factor must be 255/128 instead of 2 Note: disable CRC hack and enable accurate_colclip to see Castlevania shadow ^^ (issue #380). Note2: SW renderer is darker on Castlevania. I don't know why maybe linked to the 16 bits format poorly emulated	2015-05-26 16:49:43 +02:00
Gregory Hainaut	b0af54d33e	gsdx-ogl: better support of palette The purpose of the code is to support alpha channel of RT uses as an index for a palette texure. I'm afraid that code will likely break pure palette texture. Only used if paltex is enabled It fixes missing shadow in Star Ocean 3 (issue #374) in Native resolution with filter = 0 (no filtering) or = 2 (normal fitering) Rendering explanation: The game emulates a stencil buffer with the alpha channel The alpha channel of the RT can contains a palette texture index (format 4HH) The idea is to have a gradient of value in the palette (16/32/48/...). This way you can implement a +16/-16 and even wrap the alpha value every time you hit the pixel. Bilinear filtering breaks the rendering because it interpolates between counts so you doesn't have the exact count Upscaling breaks the rendering because the RT is reused as an input texture. It means that we need to scale it down which again create some interpolations.	2015-05-24 18:07:16 +02:00
Gregory Hainaut	a70c3bf5de	glsl: correct the alternate implementation of ps_main1 Still not yet enabled by default Potentially it can be optimized with the dot product but special care need to be taken to ensure float accuracy. Bonus: it could work on old GPU (aka DX9)	2015-05-24 13:43:36 +02:00
Gregory Hainaut	3d35e4f8ec	gsdx-ogl: implement all blends mode in shader + 250 lines !!!	2015-05-20 09:11:36 +02:00
Gregory Hainaut	8d3e3e6c5b	gsdx-ogl: more blend rework to support accurate_colclip So far few blending equations are implemented in PS. It is only for test the behavior on GoW	2015-05-20 08:00:40 +02:00
Gregory Hainaut	c5341a2711	gsdx-ogl: update blending management This way it will allow to implement all blendings operartion in FS. Of course it will be slow, but it would be nice for debug and quickly check game error rendering.	2015-05-20 00:12:52 +02:00
Gregory Hainaut	358e0d445b	gsdx-ogl: add a shader to convert depth texture into uint	2015-05-19 18:56:32 +02:00
Gregory Hainaut	ec45090358	glsl: implement an alternative of convert 1 shader Code not yet enabled. Quick test show no heavy regression but I need to carefully validate it.	2015-05-19 13:14:18 +02:00
Gregory Hainaut	51ed3e63e9	glsl: oups forget to remove a line	2015-05-10 10:39:23 +02:00
Gregory Hainaut	5565544ba6	gsdx-ogl: DATE with texture barrier Much faster for small batch that write the alpha value. Code can be enabled with accurate_date option. Here a summary of all DATE possibilities: 1/ no overlap of primitive => texture barrier (pro no setup of stencil and single draw) 2/ alpha written => small batch => texture barrier (primitive by primitive). Done in N-primitive draw calls. (based on GL_ARB_texture_barrier) => bigger batch => compute the first good primitive, slow but only 2 draw calls. (based on GL_ARB_shader_image_load_store) => Otherwise there is the UserHacks_AlphaStencil but it is a hack! 3/ alpha written => full setup of stencil ( 2 draw calls)	2015-05-09 19:54:01 +02:00
Gregory Hainaut	7fac25edb0	glsl: add shader implementation of impossible blend Issue: I don't know if destination alpha value must be multiplied by 2.0f	2015-05-09 15:02:34 +02:00
Gregory Hainaut	f029e4763f	gsdx-ogl: add the alpha value in a constant buffer	2015-05-09 15:02:34 +02:00
Gregory Hainaut	9023f5f0db	glsl: drop useless ifndef I often mix them with the real value... .	2015-05-08 19:28:17 +02:00
Gregory Hainaut	1addae1993	gsdx-ogl: don't use extra shader for sprite hack Atst == 2 && sprite_hack is equivalent to Atst == 1 It frees a bit in the shader selector, and reduces shader combinations.	2015-05-08 19:28:16 +02:00
Gregory Hainaut	530e4ce776	gsdx-ogl: drop hack that rescale primitive (to avoid upscale glitch) Rendering is bad. It renders sprites at native resolution.	2015-05-06 19:09:13 +02:00
Gregory Hainaut	ae70344fbc	gsdx-ogl: fix 2 passes DATE with GL42 (override_GL_ARB_shader_image_load_store = 1) It fixes shadows issue with Persona 3	2015-05-02 16:56:18 +02:00
Gregory Hainaut	73d04e33e9	gsdx ogl: clean various comment and old code	2015-05-01 20:04:23 +02:00
Gregory Hainaut	335695bd0e	purge GLES from GSdx ! mobile will use vulkan (or any new API) anyway	2015-05-01 20:02:17 +02:00
Gregory Hainaut	71e517108d	gsdx-ogl: fix colclip Sotc shadow and likely various games :) Thanks to pseudo	2015-04-30 23:08:05 +02:00
Gregory Hainaut	b65a62096f	gsdx-ogl: drop support of ENABLE_OGL_DEBUG Stencil can be read by GL debugger due to correct mask configuration	2015-04-30 20:02:51 +02:00
Gregory Hainaut	eaa0dae305	glsl: drop invariant qualifier Fix issue #510 and potentially it allows the compiler to better optimize the code	2015-04-27 18:52:55 +02:00
Gregory Hainaut	46ff4dc3d3	gsdx-ogl: hardware unit only support normalization of 4 bytes... (At least on recent AMD GPU)	2015-04-27 18:51:59 +02:00
Gregory Hainaut	301fe869c7	glsl: improve the logz handling This way only 0 and 1 will conflict	2015-04-26 17:51:33 +02:00
Gregory Hainaut	6a3250cad2	gsdx-ogl: fix logz management (FFXII) The issue is that z+1 overflow if z is UINT max log2(z) overflow if z is 0 So let's use an or 1 to avoid both overflow	2015-04-25 20:23:26 +02:00
Gregory Hainaut	15ae9996bb	glsl: format white space + comment	2015-04-25 12:50:12 +02:00
Gregory Hainaut	36514bd95f	glsl: fog is a single byte Give a chance to the driver to optimize if possible	2015-04-24 21:37:37 +02:00
Gregory Hainaut	c207632e49	gsdx-ogl: improve date performance for GL45 If there is no overlap, it is allowed to directly read from the render target. On SotC testcase with 6x scaling: 30fps -> 40fps Note: it requires GL_ARB_texture_barrier extension so be sure to have a recent driver Note2: it requires a lots of testing too Open question: in case of complex date (written alpha) Will it be faster to split the draw call into multiple call with no primitive overlap	2015-04-24 21:12:33 +02:00
Gregory Hainaut	795ae50ecd	gsdx-ogl: fix the recently broken advance date feature Now it is really working with a 2 stages shaders but it is still slow.	2015-04-24 20:13:38 +02:00
Gregory Hainaut	89d5e5637c	glsl: use an explicit cast insead of notEqual function If the compiler didn't optimize the code, it will be a bit faster	2015-04-24 18:01:25 +02:00
Gregory Hainaut	56836561f4	glsl: replace runtime condition by preprocessor condition It might be an easier work for the compiler I didn't replace all occurences to keep readability	2015-04-24 17:51:29 +02:00
Gregory Hainaut	62489f42f1	gsdx-ogl: add an optimization note for later Only 1 byte of fog is useful	2015-04-20 07:18:09 +02:00
Gregory Hainaut	6d253c0b8f	glsl: fix debugging of tex coordinate in apitrace	2015-04-20 07:18:08 +02:00
Gregory Hainaut	31f8c065db	gsdx-ogl: implement a new hack UserHacks_UnscaleSprite for opengl UserHacks_UnscaleSprite = 1 will unscale flat sprites UserHacks_UnscaleSprite = 2 will unscale all sprites (don't work well so far) The idea of the hack is to redo the interpolation of texture coordinate based on the non-upscaled pixel position. It avoids various glitches but sprites aren't upscaled anymore (so no more anti-aliasing, potentially a coefficient can be added).	2015-04-20 07:18:08 +02:00
Gregory Hainaut	16e6d0d305	glsl: move shader into a separate directory Only keep glsl_source.h for clarity	2015-04-19 18:49:02 +02:00
Gregory Hainaut	55fdf26898	glsl: remove the older file tfx.glsl	2015-04-19 18:49:02 +02:00
Gregory Hainaut	15264c6c63	glsl: split the main shader * separate VS/GS and FS * separate subroutine part of the FS It already complex enough without subroutine stuff. Besides I'm not sure we will keep subroutine on the future.	2015-04-19 18:49:02 +02:00
Gregory Hainaut	418f2e69a8	gsdx-ogl: implement the wildhack on the GPU Likely much faster for opengl and much easier to implement Note: hopefully UserHacks_round_sprite_offset will replace it	2015-04-13 22:14:36 +02:00
Gregory Hainaut	05eb1641f2	gsdx: add glsl debug capabilities Useful to debug bad texture sampling	2015-04-08 10:15:25 +02:00
Gregory Hainaut	e40a26be19	gsdx: add some helper to debug openGL renderer	2015-04-03 20:28:44 +02:00
Gregory Hainaut	29a14f5667	sed -i -e 's/seperately/separately/' Thanks micove again	2014-12-07 12:46:16 +01:00
Gregory Hainaut	16284d0a59	gsdx: fix a couple of typo From @micove	2014-12-07 12:12:53 +01:00
Gregory Hainaut	070dce4c83	Merge pull request #356 from PCSX2/issue-fixes various bug-tracker fixes	2014-12-06 19:48:13 +01:00
gabest11	b9b02cf749	implemented opencl program caching on disk under the system default temp folder, needs some additional work on linux	2014-12-02 00:16:35 +01:00
gabest11	e3ba15de94	opencl device selection in settings dialog	2014-12-02 00:16:34 +01:00
gabest11	6f5cd1cd4d	joined some tfx kernel calls, general speed up in most games	2014-12-02 00:16:34 +01:00
gabest11	3d2b0e3766	minor opencl kernel optimizations	2014-12-02 00:16:34 +01:00
gabest11	72cfc6a6ef	3rdparty/opencl	2014-12-02 00:16:34 +01:00
gabest11	9e20387595	Modified for opencl 1.1. While it runs on nvidia cards now, you can't use its sdk to compile gsdx, cl.hpp is missing there. Intel or amd is ok.	2014-12-02 00:16:34 +01:00
gabest11	263c097d13	solution for 32-bit z values in opencl and other minor optimizations	2014-12-02 00:16:34 +01:00
gabest11	c64f9ad9b1	squishing opencl bugs, there aren't many left hopefully	2014-12-02 00:16:34 +01:00
gabest11	ba1e522bbb	Less opencl bugs, some games are actually playable now, there are still texture errors.	2014-12-02 00:16:34 +01:00
gabest11	881735b562	minor merging errors	2014-12-02 00:16:33 +01:00
gabest11	db7c26cde7	- Experimental OpenCL renderer (missing features: point, line, texture cache, mipmap, aa1, device selection). Needs any OpenCL SDK for the common headers and stub lib to compile, tested with AMD and Intel. Too bad it is not part of the Windows SDK yet. - Renumbered renderer ids, compatible with old numbering, but it does not follow the mod3 logic anymore.	2014-12-02 00:16:33 +01:00
Gregory Hainaut	d21e6ff45f	gsdx-ogl: don't declare a sampler variable Nvidia was complaining on the fxaa shader. In doubt I also replaced it in Asmodean's shader	2014-11-20 22:07:44 +01:00
Gregory Hainaut	8c90e7cafc	gsdx-ogl: support latest fxaa version Only tested on Nvidia, please report any issue with your driver Note: requires GL4 GPU	2014-11-10 10:39:55 +01:00
Gregory Hainaut	84f844767c	gsdx-ogl: micro optimize PSConstantBuffer cache Might help to save a cache line on the CPU :)	2014-11-08 21:39:17 +01:00
Asmodean	df0a9c4683	Some small fixes, that I noticed.	2014-11-06 10:56:33 +00:00
Asmodean	937e475169	[Post-Processing] Update GSdx FX effect suite to latest version.	2014-11-03 07:15:34 +00:00
Gregory Hainaut	ccc1137e12	gsdx-ogl: merge the two vertex buffer format * Only a single VAO => Format is set once => Only a single bind at startup => GSVertexBufferStateOGL is nearly useless => barely faster but better than nothing :)	2014-10-02 20:44:22 +02:00
Gregory Hainaut	d37cc8e1e7	gsdx-ogl: GL_ARB_shading_language_420pack is now mandatory All API/drivers support it so time to remove the fallback.	2014-10-02 20:44:22 +02:00
Gregory Hainaut	594f6c33a2	gsdx-ogl-ES: require GL_EXT_shader_io_blocks + GLES3.1 Allow to use same shader interface for all API Note: on the GL API it will require GL3.3 (see next commit)	2014-09-30 22:18:01 +02:00
Gregory Hainaut	4659184cc1	gsdx-ogl: add support of clip_control (depth only) * replace the [-1;1] depth range of openGL with the DX range [0;1].	2014-09-28 12:00:34 +02:00
Gregory Hainaut	0d45e6d70e	gsdx-ogl: avoid to send constant to the GPU It was a waste of bandwith	2014-04-06 10:44:40 +02:00
Asmodean.pcsx2	0651e45460	GSdx: Just some cleaning. Reformatted fx files that were causing issues on certain text editors. They should now display correctly in those editors. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5897 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-11 04:27:32 +00:00
Asmodean.pcsx2	7d96af950e	GSdx: A few small post-processing changes, to accommodate future updates. Also removed the fallback recovery ps, and replaced the compile fail catch to a simple console print. Which I think is safer, and faster. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5894 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-09 01:01:41 +00:00
gregory.hainaut	384c0c12ea	gsdx ogl: * properly detect gl nv depth extension * Always show the hack on the gui. Add a new hack option for DATE (gl4.2) only * Save the scan mode on linux too (f7) * hopefully fix some crash on some drivers... (ensure aligment 256 bits alignment, and if not use std memcpy) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5888 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-07 19:53:01 +00:00
Asmodean.pcsx2	02afca3bb6	GSdx: Just some small post processing cleanup changes, should lower the FXAA compiling time on pause/resumes. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5872 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-02 17:50:32 +00:00
gregory.hainaut	6f3326a9a4	gsdx ogl: Fog hide this typo :p git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5861 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-02-01 10:14:01 +00:00
gregory.hainaut	84895eadd9	gsdx ogl: correct most of Z-depth issue Best setting if you driver support GL_NV_depth_buffer_float => GL_NV_Depth = 1 & logz = 0 Otherwise => GL_NV_Depth = 0 & logz = 1 Explanation of the bug: Dx z position ranges from 0.0f to 1.0f (FS ranges 0.0f to 1.0f) GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f) Why it sucks: GS small depth value will be "mapped" to -1.0f. In others all small values will be 1.0f! Terrible lost of accuraccy. The GL_NV_depth_buffer_float extension allow to set the near plane as -1.0f. So "GL z Position ranges from -1.0f to 1.0f (FS ranges 0.0f to 1.0f)" will become "GL z Position ranges from -1.0f to 1.0f (FS ranges -1.0f to 1.0f)" and therefore "z posision [0.0f;1.0f] will map to FS [0.0f;1.0f]" as DX Yes we just get back all precision lost previously :) However you need hardware (intel?) and driver support (free driver?/gles?) :( git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5860 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-31 21:30:54 +00:00
gregory.hainaut	c2aa4ff3fd	gsdx ogl: * restore the old fxaa (Asmodeam will be integrated when I got time) * port the recently added new scanline algo git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5818 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-18 14:46:13 +00:00
ramapcsx2.code	f8264bce2c	GSdx: Last patch regarding FXAA and external shader loading, I hope :) Changes are here: http://forums.pcsx2.net/Thread-Custom-Shaders-for-GSdx?pid=343285#pid343285 I've included bin\shader.fx and the settings file bin\GSdx_FX_Settings.ini. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5817 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-17 10:17:24 +00:00
ramapcsx2.code	4015db8403	GSdx: Newest scanlines shader looks much nicer this way :) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5814 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-13 18:23:53 +00:00
ramapcsx2.code	bdf5e650a6	GSdx: Quick workaround for DX9 until the shader is adapted. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5813 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-13 17:48:43 +00:00
ramapcsx2.code	1155dd6e21	GSdx: Add another scanline algorithm, made by pseudonym. This one is pretty fancy, using a cosinus function for generating the dark lines. It only works on unscaled output, so use software rendering or native resolution for hardware. It's an effect that works best on 240p converted games (like the SNES titles in Mega Man Collection). git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5812 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 19:47:49 +00:00
gregory.hainaut	23ae6f0641	the regular copy/past mistake git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5811 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 12:35:15 +00:00
gregory.hainaut	2238095a82	gsdx ogl: do the same as previous commit but for ogl ;) * fix the missing auto interlace opt when cycling with hotkey on linux git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5810 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-12 11:38:50 +00:00
ramapcsx2.code	2ac3c47c42	GSdx: Finish up my scanlines attempt. Now the 2 old shaders are back in and all 3 cycle on F7. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5809 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 23:17:07 +00:00
gregory.hainaut	f5257b2c3c	gsdx ogl: don't rely on GL_ES builtin (3 differents implementations...). Hopefully fix Linux Intel GPU. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5808 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-11 15:01:13 +00:00
sudonim1@gmail.com	a1f50d4306	GSDX: (FXAA) Treat shader model 4.1 as 4.0 and future shader models as 5 rather than failing to compile. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5803 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-09 18:32:26 +00:00
ramapcsx2.code	a1e387e23e	GSdx: Fix for the DX9 crashing issue, many thanks to Asmodean again! :) (Also sets new defaults with only minor processing, mainly FXAA now) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5798 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-06 18:03:47 +00:00
ramapcsx2.code	20ffe92ee8	GSdx: - FXAA shader replaced by Asmodeans shader suite 'PCSX2 Fx 2.00 Revised'. This is an entire enhancement suite with many configurable options like FXAA, texture sharpening or lighting tweaks. Some of the effects are pretty advanced so kudos to him allowing it to be integrated with GSdx! :) Note that there's no interface for the tweaks yet. Until those are done, the defaults will be used. Release thread here: http://forums.pcsx2.net/Thread-Custom-Shaders-for-GSdx?pid=334766#pid334766 - Disabled an option from the widescreen patches for SH:Shattered Memories. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5793 96395faa-99c1-11dd-bbfe-3dabce05a288	2014-01-05 11:24:43 +00:00
ramapcsx2.code	5a53b682f0	GSdx: Added a simple scanlines filter (2nd one activated by F7). No tweaking done yet. It shows how to get scanlines that aren't affected by input resolution or scaling. A good next step would be determining the exact number of dark lines to show :) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5770 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-12-22 15:36:47 +00:00
gregory.hainaut	e80b002929	gsdx ogl: Flush various pending work * try to use more subroutine on VS&PS, unfortunately hit a driver crash! * Call Attach/DetachContext through GSDevice so I can unmap currently mapped buffer * Implement glsl part of GL_ARB_bindless texture, again hit another driver crash! * various fix of GL_ARB_buffer_storage. Basic benchmark show only improvement on 'cold' case, I guess it will improve smoothness * try to fix GL_clear_texture, no success so far. It seem the extension is limited to basic texture (aka no depth/stencil) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5752 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-10-24 20:54:27 +00:00
gregory.hainaut	e01c6cd9ce	gsdx ogl: the proof of concept commit * GL_ARB_shader_subroutine for perf fix for nvidia => add missing shader declaration. Nvidia got +4fps on colin3 :) For the moment only 2 PS parameters are supported. Code need to be extended to support others games that often switch shader program (like xenosaga). require GL4 class hardware and the option override_GL_ARB_shader_subroutine = 1 Note: strangely on AMD linux it is slower! * GL_ARB_shader_image_load_store for accuraccy (Date) Use a signed integer texture and reenable color buffer writing Current status: Amagami_transparency.gs & P3_battle_shadows.gs are now working on Nvidia with a small perf impact. Current implementation detail: 1/ setup the standard stencil as before 2/ on remaining pixel, draw once to compute first primitive that will write a fail alpha value. 3/ final draw based on primitive id of step 2 Note: I think we would get a bad behavior if depth test&mask are enabled on step 2/3 Note2: on my limited testcase the perf impact was on CPU. It would be possible to merge step1&2 to nullifying it (could even be faster actually), however it would require more GPU power. Again require GL4 class hardware. And the option UserHacks_DateGL4 = 1 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5725 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-28 08:44:16 +00:00
gregory.hainaut	07605941ef	gsdx ogl: * some preliminary work to test/benchmark bindless texture in the future (glsl was not yet updated) Bindless texture allow to get a GPU texture pointer and then set it directly to the shader as a basic uniform. => no more texture unit selection/validation => no more texture validation neither texture hash lookup 3rdparty: update gl header to the latest gl4.4 git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5720 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-17 08:57:52 +00:00
gregory.hainaut	b4084047be	gsdx ogl: Used a basic flat interpolation for color interpolation (line & tri primitives) Card that support gs: remain only a gs to generate sprite from a line. Even dummy gs are costly for the GPU. Card that don't support gs: remove useless copy of color for line and triangle primitives Note for dx: opengl 3.2 (maybe not gles) supports both flat interpolation convention (GL_FIRST_VERTEX_CONVENTION or GL_LAST_VERTEX_CONVENTION). It might be possible to shuffle vertex index to put the last vertex in first position. - buff[0] = head + 0; - buff[1] = head + 1; - buff[2] = head + 2; + buff[0] = head + 2; + buff[1] = head + 1; + buff[2] = head + 0; git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5718 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-14 10:18:38 +00:00
gregory.hainaut	0f603a98d5	gsdx ogl: Test the ARB_shader_subroutine GL4.0 extension The idea was to replace shader program swith by pointer function calls inside shaders. At least parameters that are often changed between draw call. So far I only ported atst and colclip. Unfortunately code is "slower" (on GSdx standalone). For the moment keep the code but disabled. If I understand well the validation of program is done in the "driver thread" but the additional call are done in the overloaded MTGS thread. Apitrace profiling shows faster GPU draw calls. Another possibility is that the driver still need to validate the draw call because of others state change. Here some stats on colin3 (90 frames): without subroutine: UseProgram 125246 with subroutine: UseProgram 2906, subroutine 125945 => 3605 extra calls overhead (not all parameters are ported to subroutine) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5715 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-10 19:43:59 +00:00
gregory.hainaut	64f783410e	gsdx ogl: * preliminary work for GL4.4 extensions (ARB_clear_texture & ARB_multi_bind). Disabled until I got a 4.4 driver Note: I plan also to use ARB_buffer_storage * compute texture gl option in the constructor (avoid a couple of swith case) * redo texture unit management. Unit 0-2 for shaders, Unit 3 for texture operations. MultiBind will allow to bind shader input without disturbing texture binding points. git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5711 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-08-02 16:38:12 +00:00
gregory.hainaut	e394de86dc	gsdx ogl: * add a non-working hack: UserHacks_DateGL4, goal was to replace UserHacks_AlphaStencil + Detection of good/bad samples is based on primitive ID variable. However I'm not sure the behavior is always the same between draw call...Anyway let's keep a copy of the current work * Dump integer texture into text csv * add gl4.2 ARB_shader_image_load_store extension (needed by UserHacks_DateGL4) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5707 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-28 14:40:43 +00:00
gregory.hainaut	b912778a3c	gsdx: forget to regenerate shader in previous commit git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5702 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-13 16:31:19 +00:00
gregory.hainaut	916a091f8a	gsdx ogl: GLes * use gles header file, disable opengl code (mainly GLX, ARB_sso, geometry shader) * Define properly the function pointer, GLES use basic linking whereas GL must get the symbol dynamically * cmake: properly search and set libglesv2.so * don't use dual source blending => HW renderer work (only miss unimportant FBA) git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5701 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-13 11:39:45 +00:00
gregory.hainaut	bd5f379044	gsdx ogl: * uniform was wrongly set before the activation of the program (free driver only) * Always use only 1 drawbuffer. Easier besides previous setup was wrong for convert:ps_main1 GLES trial (v3): * add the cmake option GLES_API. Note library (libgles) are hardcoded for the moment * Disable opengl check * Disable gl_GetDebugMessageLogARB not supported! * Emulate gl_DrawElementsBaseVertex, add manually the index offset (surely slow but work) * Fix hundred of shader error (no implicit cast of integer to float...) Unfortunately GLES doesn't support dual blending so no blend in hardware renderer. Otherwise it is fine git-svn-id: http://pcsx2.googlecode.com/svn/trunk@5700 96395faa-99c1-11dd-bbfe-3dabce05a288	2013-07-12 21:12:34 +00:00

1 2 3 4 5 ...

278 Commits