github/mpv

mirror of https://github.com/mpv-player/mpv synced 2025-02-15 13:54:30 +01:00

Author	SHA1	Message	Date
wm4	e76aa7e8db	vo_opengl: rename glUploadTex, drop unused parameter Rename it to get out of OpenGL's namespace. The gl_ prefix is used by other mpv functions, but no OpenGL ones. The "slice" parameter was never actually used, and all callers passed 0 for it.	2016-05-23 21:27:18 +02:00
wm4	66079048ea	vo_opengl: unify PBO and normal OSD texture upload path The main change is actually that e first copy to a "staging" memory frame, and then upload this at once. The old non-PBO code called glTexsubImage2D for each OSD sub-bitmap. The new non-PBO code path is a bit faster now if there are many small sub-bitmaps (on Linux/nVidia). It's also a bit simpler, so this is a win. (Although I don't particularly appreciate the mixed normal/PBO texture code.)	2016-05-23 21:27:18 +02:00
wm4	049e3ccb65	vo_opengl: make ES float texture format checks stricter Some of these checks became pointless after dropping ES 2.0 support for extended filtering. GL_EXT_texture_rg is part of core in ES 3.0, and we already check for this version, so testing for the extension is redundant. GL_OES_texture_half_float_linear is also always available, at least as far as our needs go. The functionality we need from GL_EXT_color_buffer_half_float is always available in ES 3.2, and we explicitly check for ES 3.2, so reject this extension if the ES version is new enough.	2016-05-23 21:27:18 +02:00
wm4	80d702dce8	vo_opengl: make PBOs work on GLES 3.x For some reason, GLES has no glMapBuffer, only glMapBufferRange. GLES 2 has no buffer mapping at all, and GL 2.1 does not always have glMapBufferRange. On those PBOs remain unsupported (there's no reason to care about GL 2.1 without the extension). This doesn't actually work on ANGLE, and I have no idea why. (There are artifacts on OSD, as if parts of the OSD data weren't copied.) It works on desktop OpenGL and at least 1 other ES 3 implementation. Don't enable it on ANGLE, I guess.	2016-05-23 21:27:18 +02:00
wm4	75f373cc46	vo_opengl: remove unused glDrawBuffer	2016-05-23 21:27:18 +02:00
wm4	cc72a4e8c3	vo_opengl: support framebuffer invalidation Not sure how much can be gained with this, as we can't use it properly yet. For now, this is used only before rendering, which probably does overwhelmingly nothing. In the future, this should be used after temporary passes, which could possibly reduce memory usage and even memory bandwidth usage, depending on the drivers.	2016-05-23 21:27:18 +02:00
wm4	cc4a0571fa	vo_opengl: slightly improve logging of loaded extensions Only log when actual extensions are loaded, never log anything about builtins.	2016-05-23 21:27:18 +02:00
wm4	79afa347cc	vo_opengl: remove non-working rgb/rgba FBO formats Following commit `84ccebd9`, the internal helpers don't allow GL_RGB and GL_RGBA as internal formats for FBO attachments anymore. While OpenGL itself is perfectly fine with it, I don't see much of a reason to bother, and mixing sized and unsized internal formats is confusing anyway. Just remove these formats.	2016-05-20 23:21:43 +02:00
wm4	05b3a8cd14	vo_opengl: require at least ES 3.0 for float textures ES 2.0 has this weird rule that not the internalformat parameter determines the internal format, but the combination of all texture parameters. GL_OES_texture_half_float thus does not specify e.g. a GL_RGBA16F format, but requires passing GL_RGBA as format and GL_HALF_FLOAT_OES as type. We won't bother with this, since ES 2.0 is a lost cause anyway. This also removes the OpenGL error when the code is trying to create a f16 FBO for testing whether FBOs work.	2016-05-19 17:52:09 +02:00
wm4	561630cb01	vo_opengl: change error state handling and fix hwdec crashes on errors gl_video_upload_image() can fail in the hardware decoding case. In this case rendering continued "normally", which meant that pass_get_img_tex() would kill the process with an assertion failure. Fix this by allowing gl_video_upload_image() to fail, and exit rendering early enough to skip code which requires an image to be present. (Maybe this is still a bit too subtle, but better than before.) Set an error flag, and render the blue screen we introduced for shader errors. (For this purpose also move the rendering of it to final output, to ensure it's visible at all.) The error flag is temporary, because the associated failure might also be temporary, unlike shader compilation errors.	2016-05-19 12:18:48 +02:00
wm4	4e5f1ec00e	vo_opengl: d3d11egl: enable "required" GLSL extensions ANGLE doesn't handle this very strictly. But if they change this in the future, it shouldn't brick us. Not quite happy with this glsl_extensions fields, but it is quite unintrusive after all.	2016-05-19 12:02:08 +02:00
wm4	b0d3c2ede7	vo_opengl: make gl_sc_enable_extension() permanent/idempotent No reason not to, and makes the following commit slightly simpler. In fact, this makes the shaders more correct too. Normally, "#extension" must come before any normal shader text, including the "precision" directive. Not sure why this worked before. (Probably didn't.)	2016-05-19 12:01:59 +02:00
wm4	b46eaa3b43	vo_opengl: d3d11egl: enable direct nv12 sampling on ES 3.x ANGLE was missing texture() overloads in the shader compiler for GL_TEXTURE_EXTERNAL_OES textures. Support has been added upstream, so we can use it now.	2016-05-19 10:43:57 +02:00
wm4	5f7bd411fb	vo_opengl: remove unused field	2016-05-18 18:22:27 +02:00
wm4	0b911792ca	vo_opengl: fix/simplify reinitialization on dynamic reconfiguration With the new hooks mechanism, user shaders and such are actually loaded before rendering starts, instead of being loaded during rendering. This is used to cache them (instead of e.g. reparsing them every frame). The cached state wasn't cleared correctly in some situations. Namely, resizing didn't correctly enable/disable prescale hooks. Reorganize how these reinitializations are handled. Get rid of reinit_rendering(), whose meaning was pretty unclear. Call the required functions to reset or recreate state directly wherever they are needed.	2016-05-18 17:47:10 +02:00
wm4	9b56bbecdf	vo_opengl: hwdec: remove build-dependency on dxva2 wscript builds hwdec_dxva2gldx.c if gl-dxinterop is enabled, while video/dxva2.c depends on d3d-hwaccel. If d3d-hwaccel is disabled, then hwdec_dxva2gldx.c will fail to link, because it uses d3d9_surface_in_mp_image(), defined in dxva2.c. Fix this by removing the use of this function. It has barely any value at this point anyway. Just use the libavcodec documented way to get the surface directly. Fixes #3150.	2016-05-17 10:59:11 +02:00
wm4	a2d58d9986	vo_opengl: move UT_buffer to switch handling No reason to make it a special case.	2016-05-17 10:48:05 +02:00
wm4	f00040b9fc	vo_opengl: make number of cached shaders/uniform dynamic Use dynamic memory allocation, as the static allocation is starting to get annoying. Currently, SC_MAX_ENTRIES is essentially still a static upper limit on the number of shaders. But in future we could try a more clever cache replacement strategy, which does not keep stale entries forever if the maximum happens not to be reached.	2016-05-17 10:45:01 +02:00
wm4	09763bef83	vo_opengl: move cached uniforms to a separate struct	2016-05-17 10:33:45 +02:00
Niklas Haas	8262a4dd52	vo_opengl: increase shader limits The new uniforms introduced by `362015c` have exceeded the uniform limit when using high-radius tscale. In addition, the SC limit of 32 entries might be pushing it with user shaders. Just make these value a bigger to delay the onset of this same failure mode. Maybe in the future it should be reworked to grow dynamically? Either way, we can always predict a static upper bound on the number of uniforms and shader cache entries, it's just that we forgot to do so. Fixes #3151	2016-05-17 10:09:01 +02:00
Niklas Haas	e73c83d3df	vo_opengl: skip tonemapping if the output trc is HDR This makes it so that users with actual HDR displays can just set their config to target-trc=st2084 and get native HDR output. This will look a bit silly for SDR content (everything will be really bright), but for lack of a better tone mapping situation (including reverse tone mapping) this is the easiest thing to do for now. Ideally the brightness metadata should be part of the colorspace struct or something (with mpv always adapting where necessary), but it depends on the TRC and not the primaries so it's a bit more complicated than that.	2016-05-16 14:42:45 +02:00
Niklas Haas	e6f6ae94f3	vo_opengl: copy over HDR parameters to dumb_mode Since dumb mode is affected by tone mapping (which I'll call a feature, not a bug), we need to copy over the configuration - in particular, the defaults. (To prevent a render failure)	2016-05-16 14:20:48 +02:00
Niklas Haas	3bdbf6274c	vo_opengl: fall back to gamma2.2 by default for HDR content Since HDR content is now auto-detected as such, we should probably do something smarter in the "no configuration" case, such as outputting gamma 2.2 instead. This decision will affect the majority of users of stock configurations who just play back appropriately tagged HDR files, so having a good default behavior is important. "Output the HDR content as-is" is definitely not likely to give the user a good result.	2016-05-16 14:07:39 +02:00
wm4	887b2cc30f	vo_opengl: remove unnecessary casts	2016-05-16 12:58:09 +02:00
wm4	1bc902336a	vo_opengl: remove another unneeded allocation	2016-05-16 12:54:45 +02:00
wm4	51307e9f18	vo_opengl: remove possibly undefined behavior	2016-05-16 12:53:45 +02:00
wm4	70df6aa375	vo_opengl: free to-be-added hook on hook array overflow Seems sensible. Untested.	2016-05-16 12:51:36 +02:00
wm4	155857dbe6	vo_opengl: never clear file cache Make it dynamic and never remove entries from it. For now, this is better than possibly creating dangling pointers all over the place in the gl_user_shader struct. Untested.	2016-05-16 12:51:26 +02:00
wm4	0c40eee479	vo_opengl: use proper include statement	2016-05-16 12:44:12 +02:00
wm4	dd99c1e3b4	vo_opengl: remove unnecessary allocation	2016-05-16 12:42:08 +02:00
wm4	56ed5bf1c7	vo_opengl: remove unused global variable ?	2016-05-16 12:41:55 +02:00
wm4	51603e472d	vo_opengl: fix bicubic_fast in ES mode GLES shaders disallow implicit conversion from int to float. This has been broken for quite a while.	2016-05-16 12:10:47 +02:00
Niklas Haas	e047cc0931	vo_opengl: implement more HDR tonemapping algorithms This is now a configurable option, with tunable parameters. I got inspiration for these algorithms off wikipedia. "simple" seems to work pretty well, but not well enough to make it a reasonable default. Some other notable candidates: - Local functions (e.g. based on local contrast or gradient) - Clamp with soft knee (linear up to a point) - Mapping in CIE L*Ch. Map L smoothly, clamp C and h. - Color appearance models These will have to be implemented some other time. Note that the parameter "peak_src" to pass_tone_map should, in principle, be auto-detected from the SEI information of the source file where available. This will also have to be implemented in a later commit.	2016-05-16 02:49:49 +02:00
Niklas Haas	3cfe98c684	vo_opengl: avoid redundant double-gamma conversion Due to the way color management in mpv worked historically, the subtitle blending function was written to preserve the linearity of the input. (In the past, the 3DLUT function required linear inputs) Since the 3DLUT was refactored to accept the video color directly, the re-linearization after blending is now virtually always redundant. (Notably, it's also redundant when CMS is turned off, so this way of writing the code stopped making sense a long time ago. It is a remnant from before the pass_colormanage function was as flexible as it is now)	2016-05-16 02:45:39 +02:00
Niklas Haas	f81f486c68	vo_opengl: implement HDR (SMPTE ST2084) Currently, this relies on the user manually entering their display brightness (since we have no way to detect this at runtime or from ICC metadata). The default value of 250 was picked by looking at ~10 reviews on tftcentral.co.uk and realizing they all come with around 250 cd/m^2 out of the box. (In addition, ITU-R Rec. BT.2022 supports this) Since there is no metadata in FFmpeg to indicate usage of this TRC, the only way to actually play HDR content currently is to set ``--vf=format=gamma=st2084``. (It could be guessed based on SEI, but this is not implemented yet) Incidentally, since SEI is ignored, it's currently assumed that all content is scaled to 10,000 cd/m^2 (and hard-clipped where out of range). I don't see this assumption changing much, though. As an unfortunate consequence of the fact that we don't know the display brightness, mixed with the fact that LittleCMS' parametric tone curves are not flexible enough to support PQ, we have to build the 3DLUT against gamma 2.2 if it's used. This might be a good thing, though, consdering the PQ source space is probably not fantastic for interpolation either way. Partially addresses #2572.	2016-05-16 02:45:39 +02:00
Niklas Haas	965031ccd5	vo_opengl: use enums for choice options internally This is much more readable than hard-coding magic IDs all over the file, and removes the need for all the explanatory comments that were a direct result of this.	2016-05-16 02:45:39 +02:00
Niklas Haas	362015cd77	vo_opengl: abstract hook texture access behind macro This macro takes care of rotation, swizzling, integer conversion and normalization automatically. I found the performance impact to be nonexistant for superxbr and debanding, although rotation did have an impact due to the extra matrix multiplication. (So it gets skipped where possible) All of the internal hooks have been rewritten to use this new mechanism, and the prescaler hooks have finally been separated from each other. This also means the prescale FBO kludge is no longer required. This fixes image corruption for image formats like 0bgr, and also fixes prescaling under rotation. (As well as other user hooks that have orientation-dependent access) The "raw" attributes (tex, tex_pos, pixel_size) are still un-rotated, in case something needs them, but ideally the hooks should be rewritten to use the new API as much as possible. The hooked texture has been renamed from just NAME to NAME_raw to make script authors notice the change (and also deemphasize direct texture access). This is also a step towards getting rid of the use_integer pass.	2016-05-15 20:42:08 +02:00
Niklas Haas	dfc7b59909	vo_opengl: make the screen blue on shader errors This helps visually signify that somthing went wrong, and prevents confusing shader compilation errors with other types of bugs.	2016-05-15 20:42:02 +02:00
Niklas Haas	034faaa9d8	vo_opengl: use RPN expressions for user hook sizes This replaces the previous TRANSFORM by WIDTH, HEIGHT and OFFSET where WIDTH and HEIGHT are RPN expressions. This allows for more fine-grained control over the output size, and also makes sure that overwriting existing textures works more cleanly. (Also add some more useful bstr functions)	2016-05-15 20:42:02 +02:00
Niklas Haas	7c3d78fd82	vo_opengl: support external user hooks This allows users to add their own near-arbitrary hooks to the vo_opengl processing pipeline, greatly enhancing the flexibility of user shaders. This enables, among other things, user shaders such as CrossBilateral, SuperRes, LumaSharpen and many more. To make parsing the user shaders easier, shaders are now loaded as bstrs, and the hooks are set up during video reconfig instead of on every single frame.	2016-05-15 20:42:02 +02:00
Niklas Haas	d53142f9ba	vo_opengl: add optional hook points These are "sequence points" where the image could be rendered out to an FBO, hooked, and re-loaded if any such hook exists. This is perfect for things like the current user shaders system, as well as optional effects like unsharp masking. Note that since we have to pick some FBO to store the optionally hooked texture, we just store it in an array indexed by an increasing counter. Since we only ever store as many as MAX_TEXTURE_HOOKS + all internal hook points entries, this is guaranteed to be enough space. This commit also removes some of the now unused FBOs.	2016-05-15 20:42:02 +02:00
Niklas Haas	070edd7300	vo_opengl: add hooks and rework pass_read_video The hook mechanism allows arbitrary processing stages to get dispatched whenever certain named textures have been "finalized" by the code. This is mostly meant to serve as a change that opens up the internal processing in pass_read_video to user scripts, but as a side benefit all of the code dealing with offsets and plane alignment and other such confusing things has been rewritten. This hook mechanism is powerful enough to cover the needs of both debanding and prescaling (and more), so as a result they can be removed from pass_read_video entirely and implemented through hooks. Some avenues for optimization: - The prescale hook is currently somewhat distributed code-wise. It might be cleaner to split it into superxbr and NNEDI3 hooks which can each be self-contained. - It might be possible to move a large part of the hook code out to an external file (including the hook definitions for debanding and prescaling), which would be very much desired. - Currently, some stages (chroma merging, integer conversion) will always run even if unnecessary. I'm planning another series of refactors (deferred img_tex) to allow dropping unnecessary shader stages like these, but that's probably some ways away. In the meantime it would be doable to re-add some of the logic to skip these stages if we know we don't need them. - More hook locations could be added (?)	2016-05-15 20:42:02 +02:00
Niklas Haas	3d4889e91e	vo_opengl: minor change to scaler_resizes_only Instead of rounding down, we round to the nearest float. This reduces the maximum possible error introduced by this rounding operation. Also clarify the comment.	2016-05-15 20:42:02 +02:00
wm4	449b948ee8	vo_opengl: remove some pointless compatibility Remove non-texture_rg compatibility from LUT sampling. OpenGL without texture_rg support will always trigger dumb-mode, and dumb-mode does not use LUTs. It used not to, and that was when this made sense.	2016-05-14 12:02:02 +02:00
wm4	3858d37b61	vo_opengl: partially fix 0bgr format support Fixes broken colors with --vf=format=0bgr (but only if deband is disabled). 0bgr means the first byte is padding, while the following three bytes are bgr. From the vo_opengl perspective, it has 4 physical components with 3 logical components. copy_img_tex() simply copied 3 components from the physical representation, which means the last component (r) was sliced off. Fix this by not using p->color_swizzle for packed formats, and instead let packed formats set the per-plane swizzle in texplane.swizzle. The latter applies the swizzle as part of operation in copy_img_tex(), which essentially moves physical to logical representations. Unfortunately, debanding (and thus with opengl-hq defaults) is still broken.	2016-05-13 22:35:42 +02:00
wm4	09e07e92c5	vo_opengl: drop duplicate LUMINANCE_ALPHA handling This was supposed to handle the absence of GL_ARB_texture_rg. But it's already handled elsewhere. (init_format() sets texplane.swizzle accordingly.)	2016-05-13 22:07:25 +02:00
wm4	7d2c6d60da	vo_opengl: minor simplification Make the find_plane_format function take a bit count. This also makes the function's comment true for the first time the function and its comment exist. (It was commented as taking bits, but always took bytes.)	2016-05-13 21:46:08 +02:00
wm4	c9d8bc088c	vo_opengl: restrict ES2 FBO formats Only a few very low bit depth internal formats can be rendered to in pure ES2 (GL_RGB565 is the "best" one). Seems like the only potentially reasonable renderable formats in ES2 could be provided via GL_OES_rgb8_rgba8, or half-floats, so don't bother with this at all.	2016-05-13 18:50:38 +02:00
wm4	f7c81c03b2	vo_opengl: angle: log extension string	2016-05-13 15:39:11 +02:00
wm4	a228bf54c8	vo_opengl: slightly better FBO format check Now that we know in advance whether an implementation should support a specific format, we have more flexibility when determining which format to use. In particular, we can drop the roundabout ES logic. I'm not sure if actually trying to create the FBO for probing still has any value. But it might, so leave it for now.	2016-05-12 21:22:28 +02:00
wm4	0cd217b039	vo_opengl: disable scalers on ES2 Even if everything else is available, the need for first class arrays breaks it. In theory we could fix this since we don't strictly need them, but I guess it's not worth bothering. Also give the misnamed have_mix variable a slightly better name.	2016-05-12 21:22:28 +02:00
wm4	bd41a3ab62	vo_opengl: add detection for the ES texture_rg extension	2016-05-12 21:22:28 +02:00
wm4	84ccebd9b9	vo_opengl: reorganize texture format handling This merges all knowledge about texture format into a central table. Most of the work done here is actually identifying which formats exactly are supported by OpenGL(ES) under which circumstances, and keeping this information in the format table in a somewhat declarative way. (Although only to the extend needed by mpv.) In particular, ES and float formats are a horrible mess. Again this is a big refactor that might cause regression on "obscure" configurations.	2016-05-12 21:22:28 +02:00
wm4	e68b510a94	vo_opengl: correctly disable interpolation if tscale can't be used It'll fail with an assertion in the interpolation code otherwise.	2016-05-12 21:22:28 +02:00
wm4	d4712af5af	vo_opengl: angle: dump translated shaders Helpful for debugging and such.	2016-05-12 11:17:49 +02:00
wm4	01d04b100f	vo_opengl: don't use dumb-mode with 10 bit integer texture hack Recent regression. Caused it to use dumb-mode with integer textures, which on ANGLE leads to nearest scaling.	2016-05-11 17:41:00 +02:00
wm4	70b3561270	video: add --hwdec=auto-copy mode This uses the normal autoprobing rules like "auto", but rejects anything that isn't flagged as copying data back to system memory. The chunk in command.c was dead code, so remove it instead of updating it.	2016-05-11 16:20:13 +02:00
wm4	cb694ffd8e	vo_opengl: d3d11egl: support full range YUV MSDN documents this as "Introduced in Windows 8.1.". I assume on Windows 7 this field will simply be ignored. Too bad for Windows 7 users. Also, I'm not using D3D11_VIDEO_PROCESSOR_NOMINAL_RANGE_16_235 and D3D11_VIDEO_PROCESSOR_NOMINAL_RANGE_0_255, because these are apparently completely missing from the MinGW headers. (Such a damn pain.)	2016-05-11 15:40:31 +02:00
wm4	7983cda5bf	vo_opengl: d3d11egl: don't require EGL_EXT_device_query Older ANGLE builds don't export this. This change is really only for convenience, and I might revert it at some later point.	2016-05-11 15:40:31 +02:00
wm4	fd82e14888	build: merge d3d11va and dxva2 hwaccel checks We don't have any reason to disable either. Both are loaded dynamically at runtime anyway. There is also no reason why dxva2 would disappear from libavcodec any time soon.	2016-05-11 15:40:31 +02:00
wm4	fde20d10bc	vo_opengl: angle: dynamically load ANGLE ANGLE is _really_ annoying to build. (Requires special toolchain and a recent MSVC version.) This results in various issues with people having trouble to build mpv against ANGLE (apparently linking it against a prebuilt binary doesn't count, or using binaries from potentially untrusted sources is not wanted). Dynamically loading ANGLE is going to be a huge convenience. This commit implements this, with special focus on keeping it source compatible to a normal build with ANGLE linked at build-time.	2016-05-11 15:39:29 +02:00
wm4	1f6e71c7fa	vo_opengl: fix passing along swizzle from hwdec interop In theory this was needed for the previous commit (but wasn't in practice, since for hwdec the LUMINANCE_ALPHA mangling is not applied anymore, and ANGLE uses RG textures in absence of GL_ARB_texture_rg for whatever crazy reasons). In practice this caused funky colors on OSX with the uyvy422 format, which is also fixed in this commit.	2016-05-10 21:12:57 +02:00
wm4	a3d416c3d3	vo_opengl: d3d11egl: native NV12 sampling support This uses EGL_ANGLE_stream_producer_d3d_texture_nv12 and related extensions to map the D3D textures coming from the hardware decoder directly in GL. In theory this would be trivial to achieve, but unfortunately ANGLE does not have a mechanism to "import" D3D textures as GL textures. Instead, an awkward mechanism via EGL_KHR_stream was implemented, which involves at least 5 extensions and a lot of glue code. (Even worse than VAAPI EGL interop, and very far from the simplicity you get on OSX.) The ANGLE mechanism so far supports only the NV12 texture format, which means 10 bit won't work. It also does not work in ES3 mode yet. For these reasons, the "old" ID3D11VideoProcessor code is kept and used as a fallback.	2016-05-10 21:06:34 +02:00
wm4	4b3faf9dc1	vo_opengl: add an angle-es2 backend It forces es2 mode on ANGLE. Only useful for testing. Since the normal "angle" backend already falls back to es2 if es3 does not work, this new backend always exit when autoprobing it.	2016-05-10 20:19:25 +02:00
wm4	12ae19c449	vo_opengl: cosmetics: rename variables "p" is used for the private context everywhere in the source file, but renaming it also requires renaming some local variables.	2016-05-10 18:49:49 +02:00
wm4	b0b01aa250	vo_opengl: refactor how hwdec interop exports textures Rename gl_hwdec_driver.map_image to map_frame, and let it fill out a struct gl_hwdec_frame describing the exact texture layout. This gives more flexibility to what the hwdec interop can export. In particular, it can export strange component orders/permutations and textures with padded size. (The latter originating from cropped video.) The way gl_hwdec_frame works is in the spirit of the rest of the vo_opengl video processing code, which tends to put as much information in immediate state (as part of the dataflow), instead of declaring it globally. To some degree this duplicates the texplane and img_tex structs, but until we somehow unify those, it's better to give the hwdec state its own struct. The fact that changing the hwdec struct would require changes and testing on at least 4 platform/GPU combinations makes duplicating it almost a requirement to avoid pain later. Make gl_hwdec_driver.reinit set the new image format and remove the gl_hwdec.converted_imgfmt field. Likewise, gl_hwdec.gl_texture_target is replaced with gl_hwdec_plane.gl_target. Split out a init_image_desc function from init_format. The latter is not called in the hwdec case at all anymore. Setting up most of struct texplane is also completely separate in the hwdec and normal cases. video.c does not check whether the hwdec "mapped" image format is supported. This should not really happen anyway, and if it does, the hwdec interop backend must fail at creation time, so this is not an issue.	2016-05-10 18:42:42 +02:00
wm4	46fff8d31a	video: refactor how VO exports hwdec device handles The main change is with video/hwdec.h. mp_hwdec_info is made opaque (and renamed to mp_hwdec_devices). Its accessors are mainly thread-safe (or documented where not), which makes the whole thing saner and cleaner. In particular, thread-safety rules become less subtle and more obvious. The new internal API makes it easier to support multiple OpenGL interop backends. (Although this is not done yet, and it's not clear whether it ever will.) This also removes all the API-specific fields from mp_hwdec_ctx and replaces them with a "ctx" field. For d3d in particular, we drop the mp_d3d_ctx struct completely, and pass the interfaces directly. Remove the emulation checks from vaapi.c and vdpau.c; they are pointless, and the checks that matter are done on the VO layer. The d3d hardware decoders might slightly change behavior: dxva2-copy will not use the VO device anymore if the VO supports proper interop. This pretty much assumes that any in such cases the VO will not use any form of exclusive mode, which makes using the VO device in copy mode unnecessary. This is a big refactor. Some things may be untested and could be broken.	2016-05-09 20:03:22 +02:00
wm4	7a75e7c002	vo_opengl: angle: avoid fullscreen FBO copy for flipping In order to honor the differences between OpenGL and Direct3D coordinate systems, ANGLE uses a full FBO copy merely to flip the final frame vertically. This can be avoided with the EGL_ANGLE_surface_orientation extension.	2016-05-05 18:44:41 +02:00
wm4	605dd928d3	vo_opengl: angle: call eglTerminate() I hope that this does what we expect it does: destroy the EGLDisplay specific to our HDC. (Some implementations will terminate all EGL contexts in the whole process.) eglReleaseThread() merely calls eglMakeCurrent(0, 0, 0, 0), which is not enough. This commit also fixes the problem fixed with the previous commit, but I think both changes are needed to make our API usage clean.	2016-05-05 13:46:18 +02:00
wm4	f56555b514	vo_opengl: EGL: fix hwdec probing If ANGLE was probed before (but rejected), the ANGLE API can remain "initialized", and eglGetCurrentDisplay() will return a non-NULL EGLDisplay. Then if a native GL context is used, the ANGLE/EGL API will then (apparently) keep working alongside native OpenGL API. Since GL objects are just numbers, they'll simply fail to interact, and OpenGL will get invalid textures. For some reason this will result in black textures. With VAAPI-EGL, something similar could happen in theory, but didn't in practice.	2016-05-05 13:38:08 +02:00
wm4	833375f88d	command: change some hwdec properties Introduce hwdec-current and hwdec-interop properties. Deprecate hwdec-detected, which never made a lot of sense, and which is replaced by the new properties. hwdec-active also becomes useless, as hwdec-current is a superset, so it's deprecated too (for now).	2016-05-04 16:55:26 +02:00
Niklas Haas	86b5f1463c	lcms: don't warn/error on 3dlut cache misses Cache misses are a normal and expected part of the operation of a cache. It doesn't really make sense to show a user-visible warning for them. To work-around this, just skip trying to open the cache if it doesn't exist yet.	2016-05-04 12:10:55 +02:00
Niklas Haas	9054460bba	lcms: improve black point handling (especially BT.1886) First of all, black point compensation is now on by default. This is really rather harmless and only improves the result (where "improvement" means "less black clipping"). Second, this adds an option to limit the ICC profile's contrast, which helps for untagged matrix profiles that are implicitly black scaled even in colorimetric intent. (Note that this relies on BPC being enabled to work properly, which is why the two changes are tied together) Third, this uses the LittleCMS built in black point estimator instead of relying on the presence of accurate A2B tables. This also checks tags and does some amounts of noise elimination. If the option is unspecified and the profile is missing black point information, print a warning instructing the user to set the option, and fall back to 1000 otherwise.	2016-05-04 12:10:45 +02:00
wm4	eefe7ad28b	vo_opengl: vdpau: fix certain cases of preemption recovery failures The vdpau_mixer could fail to be recreated properly if preemption occured at some point before playback initialization (like when using --hwdec-preload and the opengl-cb API). Normally, the vdpau_mixer was supposed to be marked invalid when the components using it detect a preemption, e.g. in hwdec_vdpau.c. This one didn't mark the vdpau_mixer as invalid if preemption was detected in reinit(), only in map_image(). It's cleaner to detect preemption directly in the vdpau_mixer, which ensures it's always recreated correctly.	2016-05-03 13:56:11 +02:00
James Ross-Gowan	622bcb0e37	win32: replace libuuid.a usage with initguid.h Including initguid.h at the top of a file that uses references to GUIDs causes the GUIDs to be declared globally with __declspec(selectany). The 'selectany' attribute tells the linker to consolidate multiple definitions of each GUID, which would be great except that, in Cygwin and MinGW GCC 6.1, this method of linking makes the GUIDs conflict with the ones declared in libuuid.a. Since initguid.h obsoletes libuuid.a in modern compilers that support __declspec(selectany), add initguid.h to all files that use GUIDs and remove libuuid.a from the build. Fixes #3097	2016-05-01 21:10:24 +10:00
wm4	304d9d58dd	vo_opengl: fix build with GLES3 headers Legacy desktop GL only symbols. Broken by the previous commit.	2016-04-27 20:26:08 +02:00
wm4	9d16837c99	vo_opengl: support GL_EXT_texture_norm16 on GLES This gives us 16 bit fixed-point integer texture formats, including ability to sample from them with linear filtering, and using them as FBO attachments. The integer texture format path is still there for the sake of ANGLE, which does not support GL_EXT_texture_norm16 yet. The change to pass_dither() is needed, because the code path using GL_R16 for the dither texture relies on glTexImage2D being able to convert from GL_FLOAT to GL_R16. GLES does not allow this. This could be trivially fixed by doing the conversion ourselves, but I'm too lazy to do this now.	2016-04-27 19:19:56 +02:00
wm4	757c8baf8c	vo_opengl: always use sized internal formats This shouldn't make much of a difference, but should make the following commit simpler.	2016-04-27 19:02:04 +02:00
wm4	9cb036f297	vo_opengl: d3d11egl: minor simplification This should be ok. eglBindTexImage() just associates the texture, and does not make a copy (not even a conceptual one).	2016-04-27 14:35:24 +02:00
wm4	0b1ba577b1	vo_opengl: d3d11egl: print warning on unsupported colorspaces Not much we can do about. If there are many complaints, a mechanism to automatically disable interop in such cases could be added.	2016-04-27 14:34:46 +02:00
wm4	dff33893f2	d3d11va: store texture/subindex in IMGFMT_D3D11VA plane pointers Basically this gets rid of the need for the accessors in d3d11va.h, and the code can be cleaned up a little bit. Note that libavcodec only defines a ID3D11VideoDecoderOutputView pointer in the last plane pointers, but it tolerates/passes through the other plane pointers we set.	2016-04-27 14:06:50 +02:00
wm4	3706918311	vo_opengl: D3D11VA + ANGLE interop This uses ID3D11VideoProcessor to convert the video to a RGBA surface, which is then bound to ANGLE. Currently ANGLE does not provide any way to bind nv12 surfaces directly, so this will have to do. ID3D11VideoContext1 would give us slightly more control about the colorspace conversion, though it's still not good, and not available in MinGW headers yet. The video processor is created lazily, because we need to have the coded frame size, of which AVFrame and mp_image have no concept of. Doing the creation lazily is less of a pain than somehow hacking the coded frame size into mp_image. I'm not really sure how ID3D11VideoProcessorInputView is supposed to work. We recreate it on every frame, which is simple and hopefully doesn't affect performance.	2016-04-27 13:49:47 +02:00
wm4	d3a26272cd	vo_opengl: print error if opengl hwdec interop fails	2016-04-27 13:32:49 +02:00
wm4	244eff9201	vo_opengl: always reset some GL state when leaving renderer The active texture and some pixelstore parameters are now always reset to defaults when entering and leaving the renderer. Could be important for libmpv.	2016-04-22 12:08:21 +02:00
wm4	44644e69f0	vo_opengl: fix an outdated comment This wasn't updated over multiple iterations.	2016-04-16 16:16:50 +02:00
wm4	b735c0e207	lcms: include math.h Fixes #3053.	2016-04-15 13:58:41 +02:00
wm4	8c02c92ab9	vo_opengl: rpi: don't include x11 header file Copy & paste bug.	2016-04-15 09:45:15 +02:00
Niklas Haas	e3e03d0f34	vo_opengl: simplify and improve up scale=oversample Since what we're doing is a linear blend of the four colors, we can just do it for free by using GPU sampling. This requires significantly fewer texture fetches and calculations to compute the final color, making it much more efficient. The code is also much shorter and simpler.	2016-04-12 16:26:53 +02:00
wm4	f5ff2656e0	vaapi: determine surface format in decoder, not in renderer Until now, we have made the assumption that a driver will use only 1 hardware surface format. the format is dictated by the driver (you don't create surfaces with a specific format - you just pass a rt_format and get a surface that will be in a specific driver-chosen format). In particular, the renderer created a dummy surface to probe the format, and hoped the decoder would produce the same format. Due to a driver bug this required a workaround to actually get the same format as the driver did. Change this so that the format is determined in the decoder. The format is then passed down as hw_subfmt, which allows the renderer to configure itself with the correct format. If the hardware surface changes its format midstream, the renderer can be reconfigured using the normal mechanisms. This calls va_surface_init_subformat() each time after the decoder returns a surface. Since libavcodec/AVFrame has no concept of sub- formats, this is unavoidable. It creates and destroys a derived VAImage, but this shouldn't have any bad performance effects (at least I didn't notice any measurable effects). Note that vaDeriveImage() failures are silently ignored as some drivers (the vdpau wrapper) support neither vaDeriveImage, nor EGL interop. In addition, we still probe whether we can map an image in the EGL interop code. This is important as it's the only way to determine whether EGL interop is supported at all. With respect to the driver bug mentioned above, it doesn't matter which format the test surface has. In vf_vavpp, also remove the rt_format guessing business. I think the existing logic was a bit meaningless anyway. It's not even a given that vavpp produces the same rt_format for output.	2016-04-11 22:03:26 +02:00
wm4	87cb2339a6	vo_opengl: improve rotation handling (again) Apply basic transformations like rotation by 90° and mirroring when sampling from the source textures. The original idea was making this part of img_tex.transform, but this didn't work: lots of code plays tricks on the transform, so manipulating it is not necessarily transparent, especially when width/height are switched. So add a new pre_transform field, which is strictly applied before the normal transform. This fixes most glitches involved with rotating the image. Cropping and rotation are now weirdly separated, even though they could be done in the same step. I think this is not much of a problem, and has the advantage that changing panscan does not trigger FBO reallocations (I think...).	2016-04-08 22:21:38 +02:00
wm4	6325bdf197	vo_opengl: log if glGetString(GL_VERSION) returns NULL Typically happens with some implementations if no context is currrent, or is otherwise broken. This is particularly relevant to the opengl_cb API, because the API user will have no other indication what went wrong.	2016-04-08 10:57:21 +02:00
wm4	0812719497	vo_opengl: videotoolbox: use kCVPixelBufferLock_ReadOnly for screenshots Why not.	2016-04-07 19:55:51 +02:00
wm4	f033481551	videotoolbox: change how videotoolbox format is managed The underlying intention of this code is to make changing --videotoolbox-format at runtime work. For this reason, the format can't just be statically setup, but must be read from the option at runtime. This means the format is not fixed anymore, and we have to make sure the renderer is property reinitialized if the format changes. There is currently no way to trigger reinit on this level, which is why the mp_image_params.hw_subfmt field was introduced. One sketchy thing remains: normally, the renderer is supposed to be involved with VO format negotiation, which would ensure that the VO can take the format at all. Since the hw_subfmt is not part of this format negotiation, it's implied the get_vt_fmt() callback only returns formats supported by the renderer. This is not necessarily clear because vo_opengl checks this with converted_imgfmt separately. None of this matters in practice though, because we know all formats are always supported. (This still requires somehow triggering decoder reinit to make the change effective.)	2016-04-07 19:54:58 +02:00
wm4	796b32c4d7	vo_opengl: fix build breakage	2016-04-06 01:21:16 +02:00
wm4	7a5312e9a6	vo_opengl: minor simplification It's the same functionally.	2016-04-05 20:58:22 +02:00
wm4	afd685490d	vo_opengl: fix nnedi + rectangle textures Shader compilation error due to incompatible samplers.	2016-04-05 20:57:02 +02:00
Niklas Haas	a3361ad0ce	gl_lcms: choose BT.1886 gamma per-channel This makes the black point closer (chromatically) to the white point, by ensuring channels keep their consistent brightness ratios as they go down to zero. I also raised the 3DLUT version as this changes semantics and is a separate commit from the previous one.	2016-04-01 10:27:32 +02:00
Niklas Haas	2dcf18c0c0	vo_opengl: generate 3DLUT against source and use full BT.1886 This commit refactors the 3DLUT loading mechanism to build the 3DLUT against the original source characteristics of the file. This allows us, among other things, to use a real BT.1886 profile for the source. This also allows us to actually use perceptual mappings. Finally, this reduces errors on standard gamut displays (where the previous 3DLUT target of BT.2020 was unreasonably wide). This also improves the overall accuracy of the 3DLUT due to eliminating rounding errors where possible, and allows for more accurate use of LUT-based ICC profiles. The current code is somewhat more ugly than necessary, because the idea was to implement this commit in a working state first, and then maybe refactor the profile loading mechanism in a later commit. Fixes #2815.	2016-04-01 10:27:27 +02:00
Niklas Haas	ec6e8a31e0	vo_opengl: draw transparency checkerboard after upscaling This also draws it after color management etc. In a nutshell, this change makes the transparency checkerboard independent of upscaling, panning, cropping etc. It will always be the same apparent size and position (relative to the window). It will also be independent of the video colorspace and such things. (Note: This might cause white imbalance issues if playing a file with a white point that does not match the display, in absolute colorimetric mode. But that's uncommon, especially in conjunction with transparent image files, so it's not a primary concern here)	2016-03-29 22:29:19 +02:00
wm4	dae23fff09	vo_opengl: always premultiply alpha Until now, we've let the windowing backend decide. But since they usually require premultiplied alpha, and premultiplied alpha is easier to handle, hardcode it.	2016-03-29 21:56:38 +02:00
wm4	b95a10c2dd	vo_opengl: fix rotation direction The recent changes fixed rotation handling, but reversed the rotation direction. The direction is expected to be counter-clockwise, because demuxers export video rotation metadata as such.	2016-03-29 11:47:16 +02:00
wm4	bd98d9e232	vo_opengl: slightly compress gl_set_debug_logger() No functional changes.	2016-03-28 18:07:41 +02:00
wm4	b8b2a465d1	vo_opengl: reduce temporary variables in gl_transform_trans() Using a single gl_transform variable instead of many float ones makes it easier to see what it's doing. No functional change.	2016-03-28 18:07:18 +02:00
wm4	5827d9cc09	vo_opengl: fix rotation This has been completely broken since commit `93546f0c`. But even before, rotation handling did not make too much sense. In particular, it rotated the contents of the cropped image, instead of adjusting the crop rectangle as well. The result was that things like panscan or zooming did not behave as expected with rotation applied. The same is true for vertical flipping. Flipping is triggered by negative image stride. OpenGL does not support flipping the image on upload, so it's done as part of the rendering. It can be triggered with --vf=flip, but other filters and even decoders could setup negative stride to flip the image. Fix these issues by applying transforms to texture coordinates properly, and by making rotation and flipping part of these transforms. This still doesn't work properly for separated scaling. The issue is that we'd have to adjust how the passes are done. For now, pick a very stupid solution by rotating the image to a FBO, and then scaling from that. This has the avantage that the scale logic doesn't have to be complicated for such a rare case. It could be improved later. Prescaling is apparently still broken. I don't know if chroma positioning works properly either. None of this should affect the case with no rotation.	2016-03-28 17:02:27 +02:00
wm4	e5b5cc2a2f	vo_opengl: fix row-major vs. column-major confusion gl_transform_vec() assumed column-major, while everything else seemed to assumed row-major memory organization for gl_transform.m. Also, gl_transform_trans() seems to contain additional confusion. This didn't matter until now, as everything has been orthogonal, this the swapped matrix entries were always 0.	2016-03-28 16:16:09 +02:00
wm4	fb70819048	vo_opengl: don't upload potentially uninitialized memory to GL buffer If the texture count is lower than 4, entries in va.textcoord[] will remain uninitialized. While this is unlikely to be a problem (since these values are unused on the shader side too), it's not nice and might explain some things which have shown up in valgrind. Fix by always initializing the whole thing.	2016-03-28 16:13:56 +02:00
wm4	c51fe7944d	vo_openg: fix debanding + rectangle-textures	2016-03-27 16:46:01 +02:00
wm4	a76f3e8e46	vo_opengl: minor coding style adjustment	2016-03-24 21:23:22 +01:00
wm4	da015d9d00	vo_opengl: utils: some more minor shader string building optimization Instead of reallocating almost all of the shader string several times per pass, build it into a fixed buffer that will be reallocated as needed. While this still uses a linear search and full comparison of the shader text, this will compare the shader's string length first before doing a full comparison as a nice side effect. (That's also why the fragment shader is compared first - it's more likely to be different for different cache entries than the vertex shader stub.)	2016-03-24 21:22:10 +01:00
wm4	30e94fa711	vo_opengl: utils: slightly optimize shader string building Use bstr as appending buffer, which should avoid frequent reallocation and copying. The previous commit should help with this a little.	2016-03-23 22:03:53 +01:00
wm4	c7f802ee45	vo_opengl_cb: fix NULL deref Broken in commit `d6c99c85`. vo_opengl_cb.c adds the corner case that p->osd can be NULL. This make opengl-cb always crash.	2016-03-23 14:49:39 +01:00
wm4	fd3ae6c561	vo_opengl: fix blend-subtitles=video in some cases Shader miscompilation and bad output. Regression probably since commit `93546f0c` (or one of the following ones). Fixes #2982.	2016-03-22 13:34:52 +01:00
wm4	d6c99c8513	vo_opengl, osd: allow osc.lua to react faster on resizes Glitches when resizing are still possible, but are reduced. Other VOs could support this too, but don't need to do so. (Totally avoiding glitches would be much more effort, and probably not worth the trouble. How about you just watch the video the player is playing, instead of spending your time resizing the window.)	2016-03-21 22:23:41 +01:00
wm4	3353923f29	vo_opengl: GLX: try to create 3.3 core profile context Until now, we have tried to create a GL 3.0 context. The main reason for this is that many Mesa-based drivers did not support anything better. But some drivers (Mesa AMD) will not report a higher OpenGL version, because their compatibility mode is restricted. While later GL features are reported as extensions just fine, there doesn't seem to be a way to determine or enable higher GLSL versions. Add some more shitty hacks to try to deal with this messed up situation, and try to probe each interesting GL version separately (starting with 3.3, then 3.2 etc.). Other backends might suffer from similar problems, but these will have to deal with it on their own. Probably fixes #2938, or maybe not.	2016-03-19 19:31:17 +01:00
wm4	0d19316de2	vo_opengl: dxva2egl: ignore alpha component in the mapped video converted_imgfmt will be used by the renderer logic to build an appropriate shader chain. It doesn't influence the format of any textures. Thus it doesn't matter whether the hw video surface is mapped as RGB or RGBA. What matters is if the video actually contains alpha or not. Since virtually all hardware decoder do not support alpha in any way, this can be hardcoded as "no alpha". This avoids unnecessary GPU work.	2016-03-17 14:58:49 +01:00
wm4	45db7d52a9	vo_opengl: fix operation without GL_ARB_texture_rg This also gets rid of the kind of hard to read texture swizzle setup and turns it into something dumber. Assumes that we don't create any FBOs with 2 channel formats. (Only the video source textures are handled by this commit.)	2016-03-17 12:50:26 +01:00
wm4	71642f5d23	vo_opengl: fix sharpen filter Regression since commit `93546f0c`. Fixes #2956.	2016-03-16 19:09:52 +01:00
Kevin Mitchell	d756dd8cbf	vo_opengl: only open one OpenGL/DX interop handle when using dxva2 Previously, gl->DXOpenDeviceNV was called twice using dxva2 with dxinterop. AMD drivers refused to allow this. With this commit, context_dxinterop sets its own implementation of MPGetNativeDisplay, which can return either a IDirect3DDevice9Ex or a dxinterop_device_HANDLE depending on the "name" request string. hwdec_dxva2gldx then requests both of these avoiding the need to call gl->DXOpenDeviceNV a second time.	2016-03-15 17:04:58 -07:00
Kevin Mitchell	8ff09f3217	vo_opengl: add dxva2 interop to angle backend Like dxinterop, this uses StretchRect or RGB conversion. This is unavoidable as long as we use the dxva2 API, as there is no way to access the raw hardware decoded Direct3D9 surfaces.	2016-03-10 15:49:55 -08:00
wm4	0e1e4005fb	vo_opengl: use the same type for cached and current uniform values Slightly improvement over the previous commit.	2016-03-10 22:42:20 +01:00
igv	5199c2ee3a	vo_opengl: cache the values of the uniform variables	2016-03-10 22:40:14 +01:00
igv	95ca308d44	vo_opengl: cache the locations of the uniform variables	2016-03-09 22:57:05 +01:00
Niklas Haas	2db1b9e474	vo_opengl: decrease default superxbr-edge-strength The default of 1.0 was basically making half the algorithm do nothing, since it turned off all diagonal contributions. The upstream default is 0.6, and this produces a more reasonable image.	2016-03-07 22:31:15 +01:00
Niklas Haas	c68721b832	vo_opengl: improve superxbr algorithm The values were changed to reflect an upstream change in the source for the super-xBR implementation. The anti-ringing code was basically not working at all, the new algorithm _significantly_ improves the result (reduces ringing).	2016-03-07 22:31:15 +01:00
Niklas Haas	9f91bc4b75	vo_opengl: refactor superxbr algorithm This is a fresh implementation from scratch that carries with it significantly less baggage and verbosity from the previous (ported) version. The actual values for the masks and such were copied from the current code. Behavior and performance should be unaffected. An important difference between the old code and the new code is that the new code always explicitly samples from the first component, rather than being able to process multiple planes at once. Since prescale-luma only affects luma, I deemed this unnecessary. May change in the future, if prescale-chroma ever gets implemented. But prescaling multiple planes would be slow to do this way. (Better would be to generalize it to differently-sized vectors)	2016-03-07 22:31:15 +01:00
Niklas Haas	a4dfc28fe1	vo_opengl: refactor plane-skipping optimizations Instead of hard-coding the logic and planes to skip, factor this out to a reusible function, and instead add the number of relevant coordinates to the texture state.	2016-03-05 13:08:38 +01:00
Niklas Haas	b81036524a	vo_opengl: rename prescale to prescale-luma Since prescale now literally only affects the luma plane (and the filters are all designed for luma-only operation either way), the option has been renamed and the documentation updated to clarify this.	2016-03-05 13:08:38 +01:00
Niklas Haas	8ac6f6acf0	vo_opengl: add macros for scaler units There was no real point in hard-coding these all over the place, especially since the order was sort of arbitrary and confusing.	2016-03-05 13:08:38 +01:00
Niklas Haas	93546f0c2f	vo_opengl: refactor pass_read_video and texture binding This is a pretty major rewrite of the internal texture binding mechanic, which makes it more flexible. In general, the difference between the old and current approaches is that now, all texture description is held in a struct img_tex and only explicitly bound with pass_bind. (Once bound, a texture unit is assumed to be set in stone and no longer tied to the img_tex) This approach makes the code inside pass_read_video significantly more flexible and cuts down on the number of weird special cases and spaghetti logic. It also has some improvements, e.g. cutting down greatly on the number of unnecessary conversion passes inside pass_read_video (which was previously mostly done to cope with the fact that the alternative would have resulted in a combinatorial explosion of code complexity). Some other notable changes (and potential improvements): - texture expansion is now always handled in pass_read_video, and the colormatrix never does this anymore. (Which means the code could probably be removed from the colormatrix generation logic, modulo some other VOs) - struct fbo_tex now stores both its "physical" and "logical" (configured) size, which cuts down on the amount of width/height baggage on some function calls - vo_opengl can now technically support textures with different bit depths (e.g. 10 bit luma, 8 bit chroma) - but the APIs it queries inside img_format.c doesn't export this (nor does ffmpeg support it, really) so the status quo of using the same tex_mul for all planes is kept. - dumb_mode is now only needed because of the indirect_fbo being in the main rendering pipeline. If we reintroduce p->use_indirect and thread a transform through the entire program this could be skipped where unnecessary, allowing for the removal of dumb_mode. But I'm not sure how to do this in a clean way. (Which is part of why it got introduced to begin with) - It would be trivial to resurrect source-shader now (it would just be one extra 'if' inside pass_read_video).	2016-03-05 13:08:38 +01:00
wm4	5c20263364	vo_opengl: wayland: don't destroy NULL wl_egl_window The wayland client API crashes intentionally when trying to free NULL objects. (Thanks.)	2016-03-03 15:30:28 +01:00
igv	b638a413c3	vo_opengl: remove redundant code	2016-02-28 17:46:16 +01:00
igv	8bafd68fff	vo_opengl: set uniform variable "pixel_size" for internal shaders	2016-02-26 23:21:03 +01:00
igv	3d2fb9e9a8	vo_opengl: dxinterop: fix compatibility issue with Vista	2016-02-26 23:20:27 +01:00
wm4	f0b15ad447	vo_opengl, vo_rpi: unbreak a few things Commit `2f562825` didn't remove the "color" declaration for these. Since the shader header already declares it, shader compilation broke.	2016-02-24 10:23:50 +01:00
Niklas Haas	2f562825e0	vo_opengl: declare vec4 color inside fragment shader stub Why was this done so stupidly, with so many complicated special cases, before? Declare it once so the shader bits don't have to figure out where and when to do so themselves.	2016-02-23 20:58:15 +01:00
igv	f0794d0544	vo_opengl: set uniform variable "pixel_size" pixel_size is often used variable, also reciprocal is a costly operation for AMD and older nVidia (prior to Kepler) GPUs.	2016-02-22 22:33:04 +01:00
igv	935c8402bc	vo_opengl: set the correct size of the input image	2016-02-22 22:32:49 +01:00
James Ross-Gowan	53ef4822e8	dxva2: fix shared surface type The WGL_NV_DX_interop spec says that a shared IDirect3DSurface9 must not be lockable, but off-screen plain surfaces are always lockable and using them causes Nvidia drivers to crash. Use a rendertarget for the shared surface instead. This also changes the name of the DX_interop handle for the rendertarget to match the name of the DirectX object (rather than the GL one) to match the convention used in context_dxinterop.c.	2016-02-19 00:37:21 +11:00
wm4	c01aaabb3e	vo_opengl: use correct gl_target variable p->gl_target and plane->gl_target are always the same value here, but semantically plane->gl_target is the correct one.	2016-02-18 10:46:03 +01:00
wm4	57c31f4130	vo_opengl: don't use normalized coords for debanding rectangle textures Fixes #2831.	2016-02-18 10:42:26 +01:00
wm4	d6af58c699	vo_opengl: pass the correct target to deband functions Apple crap (namely hardware decoding interop) forces us to use rectangle textures for input. But after that we continue with normal textures. This was not considered for debanding, and the sampler type used for it can be different depending on the exact render chain. Simply use the target type of the input texture.	2016-02-18 10:41:13 +01:00
Kevin Mitchell	0d40140668	wscript: remove dxva2-dxinterop configure test Wasn't really necessary as it was equivalent to gl-dxinterop.	2016-02-17 10:17:52 -08:00
Kevin Mitchell	2d1f42089c	vo_opengl: dxinterop: add dxva2 passthrough Use dxva2 surface to fill RGB IDirect3DSurface9 shared with opengl via DXRegisterObjectNV.	2016-02-17 09:07:12 -08:00
Kevin Mitchell	941885b62c	vo_opengl: dxinterop: improve error messages * use mp_HRESULT_to_str/mp_LastError_to_str * make some messages non-identical * replace "GL" -> "OpenGL" * change some MP_FATAL to MP_ERR that don't actually kill the vo	2016-02-17 06:59:02 -08:00
wm4	fd80fcd3f3	vo_opengl: unconfuse Coverity It thinks that integer_conv_fbo[index] is implied to be accessed with up to index=5. Although that is theoretical only, it has a point that this makes no sense. Use the same constant for the array allocation, to make it more uniform and robust. Fixes CID 1350060.	2016-02-12 15:56:58 +01:00
Jan Ekström	4b97869e5f	Enable building the opengl-cb video renderer on Android * Add Android-specific OpenGL ES feature and checks * Add missing GL_* symbols for Android (list gathered by Ilya Zhuravlev <whatever@xyz.is>)	2016-02-10 21:29:42 +01:00
wm4	9f6dfaabf9	vo_opengl: vdpau: call glVDPAUFiniNV only if initialized This is "more correct". See #2798.	2016-02-08 13:39:33 +01:00
wm4	05ffde6599	vo_opengl: hwdec: use IDs for API, and log which backend is used Since there can be multiple backends for a single API (vaapi can use GLX or EGL), not logging the exact backend name is annoying. So add it. At the same time, there is no need to duplicate the name as used by the --hwdec options, so replace it with using the numeric hwdec API ID.	2016-02-01 20:02:52 +01:00
wm4	fb3b8e1e25	vo_opengl: do chroma merging in integer conversion stage This is a huge win when playing yuv420p10 on ANGLE - the 2 conversion stages for planes 1 and 2 and the chroma merging stage are all merged into one.	2016-01-27 21:08:30 +01:00
wm4	f8bb24184b	vo_opengl: add precision qualifier to usampler2D on ANGLE GLES requires this. Some more common sampler types have default precisions, but not usampler2D. Newer ANGLE builds verify this more strictly than older builds, so this wasn't caught before. Fixes #2761.	2016-01-27 21:07:57 +01:00
wm4	34bead4859	vo_opengl: replace tscale-interpolates-only with interpolation-threshold The previous approach was too naive, and can e.g. ruin playback if scheduling switches e.g. between 1 and 2 vsync per frame.	2016-01-27 21:07:17 +01:00
wm4	7b6e3772ab	vo_opengl: support 10 bit support with ANGLE GLES does not support high bit depth fixed point textures for unknown reasons, so direct 10 bit input is not possible. But we can still use integer textures, which are supported by GLES 3.0. These store integer data just like the standard fixed point textures, except they are not normalized on sampling. They also don't support bilinear filtering, and require a special sampler ("usampler2D"). While these texture formats enable us to shuffle the data to the GPU, they're rather impractical with the requirements mentioned above and our current architecture. One problem is that most code assumes it can always use bilinear scaling (even if bilinear is never used when using appropriate scale/cscale options). Another is that we don't have any concept of running a function on a texture in an uniform way. So for now, run a simple conversion step through a FBO. The FBO will use the rgba16f format normally, which gives enough bits for 10 bit, and will at least gracefully degrade with higher depth input. This is bound to be much slower than a more "direct" method, but at least it works and is simple to implement. The odd change of function call order in init_video() is to properly disable "dumb mode" (no FBO use) if these texture formats are in use.	2016-01-26 21:35:23 +01:00
wm4	beb7094301	vo_opengl: actually reset use_normalized_range field This was never reset - absolutely can't be right. If the renderer somehow switches back to another codepath, it certainly has to be reset. Maybe this was hard to hit, as the normalization is going to be idempotent in simpler cases (like rendering RGBA input). Also get rid of the "merged" variable.	2016-01-26 21:35:23 +01:00
wm4	fc3ca14ef7	vo_opengl: default to rgba16f FBOs on ANGLE Although it has only 1 bit more precission than rgba10_a2, it was reported to improve the visual quality.	2016-01-26 21:35:16 +01:00
wm4	521110054d	vo_opengl: add tscale-interpolates-only sub-option	2016-01-25 21:46:40 +01:00
wm4	bd1fb6f9b1	vo_opengl: default scaler-resizes-only sub-option to yes Often requested. The main argument, that prominent scalers like sharpen change the image even if no scaling happens, disappeared anyway. ("sharpen", unsharp masking, is neither prominent nor a scaler anymore. This is an artifact from MPlayer, which fuses unsharp masking with bilinear scaling in order to make it single-pass, or so.)	2016-01-25 21:46:40 +01:00
wm4	7f300b4204	vo_opengl: rename custom shader entrypoint from sample to sample_pixel "sample" is a reserved identifier at least in GLES ES. Suggestions for a better name than "sample_pixel" are still welcome. Fixes #2733.	2016-01-25 20:24:41 +01:00
wm4	eaf2eebb50	vo_opengl: vdpau: better handling of preemption recovery If recovery from preemption is done successfully, continue normally. Only fail if it's preempted during init.	2016-01-25 19:40:35 +01:00
wm4	3a015b9ec7	video: remove some useless old RGB formats Some VOs had support for these - remove them. Typically, these formats will have only some use in cases where using RGB software conversion with libswscale is faster than letting the VO/GPU do it (i.e. almost never). For the sake of testing this case, keep IMGFMT_RGB565. This is the least messy format, because it has no padding/alpha bits with unknown semantics. Note that decoding to these formats still works. We'll let libswscale repack the data to whatever the VO in use can take.	2016-01-25 10:43:35 +01:00
wm4	4c25f49236	vo_opengl: vaapi: don't expect EGL exts. to be in common ext. string	2016-01-22 19:56:10 +01:00
wm4	03b50d4bf6	vo_opengl: vaapi: reorganize platform entrypoints as table	2016-01-21 13:32:29 +01:00
wm4	68366b05f2	vo_opengl: add KMS/DRM VAAPI hardware decoding interop Just requires glueing it together with Bloat Super Glue (tm).	2016-01-20 19:41:29 +01:00
wm4	e4ec0f42e4	Change GPL/LGPL dual-licensed files to LGPL Do this to make the license situation less confusing. This change should be of no consequence, since LGPL is compatible with GPL anyway, and making it LGPL-only does not restrict the use with GPL code. Additionally, the wording implies that this is allowed, and that we can just remove the GPL part.	2016-01-19 18:36:34 +01:00
wm4	8a9b64329c	Relicense some non-MPlayer source files to LGPL 2.1 or later This covers source files which were added in mplayer2 and mpv times only, and where all code is covered by LGPL relicensing agreements. There are probably more files to which this applies, but I'm being conservative here. A file named ao_sdl.c exists in MPlayer too, but the mpv one is a complete rewrite, and was added some time after the original ao_sdl.c was removed. The same applies to vo_sdl.c, for which the SDL2 API is radically different in addition (MPlayer supports SDL 1.2 only). common.c contains only code written by me. But common.h is a strange case: although it originally was named mp_common.h and exists in MPlayer too, by now it contains only definitions written by uau and me. The exceptions are the CONTROL_ defines - thus not changing the license of common.h yet. codec_tags.c contained once large tables generated from MPlayer's codecs.conf, but all of these tables were removed. From demux_playlist.c I'm removing a code fragment from someone who was not asked; this probably could be done later (see commit `15dccc37`). misc.c is a bit complicated to reason about (it was split off mplayer.c and thus contains random functions out of this file), but actually all functions have been added post-MPlayer. Except get_relative_time(), which was written by uau, but looks similar to 3 different versions of something similar in each of the Unix/win32/OSX timer source files. I'm not sure what that means in regards to copyright, so I've just moved it into another still-GPL source file for now. screenshot.c once had some minor parts of MPlayer's vf_screenshot.c, but they're all gone.	2016-01-19 18:36:06 +01:00
Dmitrij D. Czarkoff	ea442fa047	mpv_talloc.h: rename from talloc.h This change helps avoiding conflict with talloc.h from libtalloc.	2016-01-11 21:05:55 +01:00
wm4	453ea2cb6c	vaapi: replace VA_STR_FOURCC	2016-01-11 20:30:36 +01:00
wm4	27bc881cd8	vo_opengl: generic semi-planar support Should take care of the planned FFmpeg AV_PIX_FMT_P010 addition. (This will eventually be needed when doing HEVC Main 10 decoding with DXVA2 copyback.)	2016-01-07 16:31:52 +01:00
wm4	38636345b1	vo_opengl: hwdec_vdpau: relicense under LGPL All code was written by myself.	2016-01-07 11:22:54 +01:00
wm4	9604c45e24	vo_opengl: video.h: fix license This is a mistake coming from commit `6ef06aa1`: it accidentally changed the license from GPL/LGPL dual to GPL only.	2016-01-07 10:48:04 +01:00
wm4	e69d1f2780	vo_opengl: hwdec_vaegl: change license to LGPL 2.1 This file claims to be based on the "MPlayer VA-API patch", but this is untrue. Only some glue code was copied from hwdec_vaglx.c, and this glue code was never in MPlayer or the MPlayer VA-API patch in any form, and instead part of the mpv-original way we do hardware decoding OpenGL interop. The EGL interop method didn't exist at the time the MPlayer VA-API patch was created either.	2016-01-07 10:46:15 +01:00
wm4	c725f39bae	vo_opengl: fix operation on GLES 2.0 GLSL in GLES 2.0 did not have line continuation in its preprocessor. This broke shader compilation. It also broke subtitle rendering in vo_rpi, which reuses some of the OpenGL code. Line continuation was finally added in GLES 3.0, which is perhaps the reason why ANGLE accepted it.	2016-01-04 16:34:16 +01:00
Bin Jin	2f4bd58f4a	vo_opengl: reset nnedi3 weights properly Fixes #2661	2016-01-03 23:33:54 +01:00
wm4	aa587580c8	vo_rpi: fix compilation Untested, but should be fine. Broken by commit `0a0bb905`. Also fix the include statement in context_rpi.c, which caused another compilation failure. Also untested. (Because I'm lazy.) Fixes #2638.	2015-12-26 19:18:47 +01:00
wm4	082c23515f	vo_opengl: fix operation on GLSL versions earlier than 1.30 GLSL below version 1.30 does not support mix() with a boolean interpolation value. Use ?: instead. Untested, but probably works.	2015-12-24 14:44:46 +01:00
wm4	1a6f3c56ea	vo_opengl: fall back to gcc thread local storage gcc 4.8 does not support C11 thread local storage. This is a bit annoying, so add a hack to use the gcc specific __thread extension if C11 TLS is not available. (This is used for the extremely silly mpv-internal way hwdec modules access some platform specific handles. Disabling it simply made hwdec_vaegl.c always fail initialization.) Fixes #2631.	2015-12-23 17:59:35 +01:00
wm4	eac0665b8d	vo_opengl: blend transparent video against tiles by default Add a "blend-tiles" choice to the "alpha" sub-option. This is pretty simplistic and uses the GL raster position to derive the tiles. A weird consequence is that using --vo=opengl and --vo=opengl-hq gives different scaling behavior (screenspace pixel size vs. source video pixel size 16x16 tiles), but it seems we don't have easy access to the original texture coordinates. Using the rasterpos is probably simpler. Make this option the default.	2015-12-22 23:18:46 +01:00
wm4	31c29495ca	vo_opengl: x11: fix alpha windows long is 64 bits on x86_64 on Linux, which means the check for the corner case of computing the depth mask is wrong. Also, X11 compositors seem to expect premultiplied alpha.	2015-12-20 13:26:25 +01:00
wm4	cd24fdcd5a	vo_opengl: disable pbo by defaults for opengl-hq Too many problems.	2015-12-19 16:26:36 +01:00
wm4	47f2f554a3	vo_opengl: handle alpha with odd bit widths too Since alpha isn't pulled through the colormatrix (maybe it should?), we reject alpha formats with odd sizes, such as yuva444p10. But the awful tex_mul path in vo_opengl does this anyway (at some points even explicitly), which means there will be a subtle difference in handling of 16 bit yuv alpha formats. Make it consistent and always apply the range adjustment to the alpha component. This also means odd sizes like 10 bit are supported now. This assumes alpha uses the same "shifted" range as the yuv color channels for depths larger than 8 bit. I'm not sure whether this is actually the case.	2015-12-19 16:11:34 +01:00
wm4	a0519f1d18	vo_opengl: cocoa: output premultiplied alpha Which is apparently what is expected here. (I'm pretty sure X11 compositors want stright alpha, so 2 code paths are needed.)	2015-12-19 14:14:12 +01:00
wm4	4cc1861378	vo_opengl: prefix per-backend source files with context_	2015-12-19 14:14:12 +01:00
wm4	6154c1d06d	vo_opengl: split backend code from common.c to context.c Now common.c only contains the code for the function loader, while context.c contains the backend loader/dispatcher. Not calling it "backend.c", because the central struct is called MPGLContext.	2015-12-19 14:14:12 +01:00
wm4	32cd85bc7e	vo_opengl: x11egl: retrieve framebuffer depth This is used for dithering, although I'm not aware of anyone who got higher than 8 bit depth support to work on Linux. Also put this into egl_helpers.c. Since EGL is pseudo-portable at best I have no hope that the EGL context creation code in all the backends can be fully shared. But some self-contained functionality can definitely be shared.	2015-12-19 14:14:12 +01:00
wm4	3394d37b4e	vo_opengl: refactor how framebuffer depth is passed from backends Store the determined framebuffer depth in struct GL instead of MPGLContext. This means gl_video_set_output_depth() can be removed, and also justifies adding new fields describing framebuffer/backend properties to struct GL instead of having to add more functions just to shovel the information around. Keep in mind that mpgl_load_functions() will wipe struct GL, so the new fields must be set before calling it.	2015-12-19 14:14:12 +01:00
wm4	d2baaaa7df	vo_opengl: cocoa: implement alpha window support With --vo=opengl:alpha=yes, the Cocoa backend will now render alpha video without background.	2015-12-19 10:24:03 +01:00
wm4	58d45f7b3e	vo_opengl: fix "win" backend name Although the source file is named w32.c, the backend name was "win" until recently. It was accidentally changed to "w32"; fix it. Fixes #2608 (the manual is correct).	2015-12-19 08:39:14 +01:00
James Ross-Gowan	d0fd68f6df	vo_opengl: dxinterop: prevent crash after lost device When a Direct3D 9Ex device fails to reset, it gets put into the lost state, so set the lost_device flag and don't attempt to present until the device moves out of that state. Failure to recreate the size- dependent objects should set lost_device as well, since we shouldn't try to present in that state. Also, it looks like I was too eager to remove code that sets priv members to NULL and I accidentally removed some that was needed.	2015-12-14 23:09:54 +11:00
James Ross-Gowan	8d0a6cd035	vo_opengl: dxinterop: prevent crash with 0-size window Direct3D doesn't like 0-sized swapchain dimensions, even when those dimensions are automatically set. Manually set them to a size that isn't zero instead.	2015-12-14 01:11:56 +11:00
wm4	f24ba544cd	vo_opengl: enable brightness/contrast controls for RGB Why not. Also, instead of disabling hue/saturation for RGB, just don't apply them. (They don't make sense for conversion matrixes other than YUV, but I can't be bothered to keep the fine-grained disabling of UI controls either.)	2015-12-12 14:47:30 +01:00
James Ross-Gowan	3d12312806	vo_opengl: add dxinterop backend WGL_NV_DX_interop is widely supported by Nvidia and AMD drivers. It allows a texture to be shared between Direct3D and WGL, so that rendering can be done with WGL and presentation can be done with Direct3D. This should allow us to work around some persistent WGL issues, such as dropped frames with some driver/OS combos, drivers that buffer frames to increase performance at the cost of latency, and the inability to disable exclusive fullscreen mode when using WGL to render to a fullscreen window. The addition of a DX_interop backend might also enable some cool Direct3D-specific enhancements in the future, such as using the GetPresentStatistics API to get accurate frame presentation timestamps. Note that due to a driver bug, this backend is currently broken on Intel. It will appear to work as long as the window is not resized too often, but after a few changes of size it will be unable to share the newly created renderbuffer with GL. See: https://software.intel.com/en-us/forums/graphics-driver-bug-reporting/topic/562051	2015-12-11 18:29:18 +01:00
wm4	47e6ef0bdf	vo_opengl: remove one more XYZ special-case The XYZ colorspace on XYZ pixfmt is enforced in some sanitation routine.	2015-12-09 17:10:38 +01:00
Bin Jin	6d36c432ab	vo_opengl: fix precision loss of fruit dithering matrix With default setting, the matrix for fruit dithering requires 12 bits precision (values from 0/4096 to 4095/4096). But 16-bit float provides only 10 bits. In addition, when `dither-size-fruit=8` is set, 16 bits are required from the texture format. Fix this by attempting to use 16 bit integer texture first. This is still not precise, but should be better than using a half float.	2015-12-09 00:36:48 +01:00
wm4	45ae0716be	csputils: rename "yuv2rgb" functions They're not necessarily restricted to YUV aka YCbCr. vo_direct3d.c and demux_disc.c (DVD specific code) changes untested.	2015-12-09 00:23:36 +01:00
wm4	c5c7b239b6	csputils, vo_opengl: remove XYZ special case in color matrix retrieval This just seems unnecessary. Refactor it a bit. There should be no functional changes.	2015-12-09 00:16:51 +01:00
wm4	0d7d935e88	vo_opengl: fix shader compilation regression The recent LUT adjustment changes broke interpolation. The concatenation of the shader stages is a bit messy, and it seems like sampler_prelude is not a good place to add this macro. Always add the macro to every shader instead. (While this doesn't seem too elegant, this isn't too inelegant either, and goes these problems out of the way.)	2015-12-08 03:18:47 +01:00
wm4	c138505813	vo_opengl: enable colormatrix even for RGB input Enables brightness/contrast controls, and handles gbrp10 correctly.	2015-12-07 23:48:59 +01:00
wm4	663415b914	vo_opengl: fix issues with some obscure pixel formats The computation of the tex_mul variable was broken in multiple ways. This variable is used e.g. by debanding for moving expansion of 10 bit fixed-point input to normalized range to another stage of processing. One obvious bug was that the rgb555 pixel format was broken. This format has component_bits=5, but obviously it's already sampled in normalized range, and does not need expansion. The tex_mul-free code path avoids this by not using the colormatrix. (The code was originally designed to work around dealing with the generally complicated pixel formats by only using the colormatrix in the YUV case.) Another possible bug was with 10 bit input. It expanded the input by bringing the [0,2^10) range to [0,1], and then treating the expanded input as 16 bit input. I didn't bother to check what this actually computed, but it's somewhat likely it was wrong anyway. Now it uses mp_get_csp_mul(), and disables expansion when computing the YUV matrix.	2015-12-07 23:48:59 +01:00
Bin Jin	c569d4f6ed	vo_opengl: decrease default lookup texture size to 64 It turns out that with accurate lookup we can decrease the default size of texture now. Do it to compensate the performance loss introduced by the LUT_POS macro.	2015-12-07 23:48:40 +01:00
Bin Jin	e6058d3dc3	vo_opengl: make LOOKUP_TEXTURE_SIZE configurable	2015-12-07 23:48:18 +01:00
Bin Jin	c1a96de41c	vo_opengl: Fix minor LUT sampling error Define a macro to correct the coordinate for lookup texture. Cache the corrected coordinate for 1D filter and use mix() to minimize the performance impact.	2015-12-07 23:48:15 +01:00

... 2 3 4 5 6 ...

471 Commits