Go to file
Loongson Technology Corporation Limited 00b8e3b9cd loongarch: Improve the performance of sad/sad_x3/sad_x4 series functions
Performance has improved from 4.92fps to 6.32fps.
Tested with following command:
./configure && make -j5
./x264 --threads 4 -o out.mkv yuv_1920x1080.yuv

functions           performance     performance
                        (c)            (asm)
sad_4x4                 13               3
sad_4x8                 26               7
sad_4x16                57               13
sad_8x4                 24               3
sad_8x8                 54               8
sad_8x16                108              13
sad_16x8                95               8
sad_16x16               189              13
sad_x3_4x4              37               6
sad_x3_4x8              71               13
sad_x3_8x4              70               8
sad_x3_8x8              162              14
sad_x3_8x16             323              25
sad_x3_16x8             279              15
sad_x3_16x16            555              27
sad_x4_4x4              49               8
sad_x4_4x8              95               17
sad_x4_8x4              94               8
sad_x4_8x8              214              16
sad_x4_8x16             429              33
sad_x4_16x8             372              18
sad_x4_16x16            740              34

Signed-off-by: wanglu <wanglu@loongson.cn>
2023-10-10 09:09:52 +08:00
common loongarch: Improve the performance of sad/sad_x3/sad_x4 series functions 2023-10-10 09:09:52 +08:00
doc Fix typos with codespell tool 2021-09-29 21:11:16 +00:00
encoder Fix compilation of only 8 or 10 bit by a non-optimizing compiler 2023-01-28 21:45:30 +03:00
extras Bump dates to 2023 2023-01-28 16:37:02 +03:00
filters Bump dates to 2023 2023-01-28 16:37:02 +03:00
input Bump dates to 2023 2023-01-28 16:37:02 +03:00
output Bump dates to 2023 2023-01-28 16:37:02 +03:00
tools loongarch: Init LSX/LASX support 2023-10-10 09:00:09 +08:00
.gitignore .gitignore: Ignore TAGS file 2018-01-17 18:31:04 +01:00
.gitlab-ci.yml CI: Fix vlc-contrib linking on macOS 2022-09-01 23:17:40 +03:00
AUTHORS Fix typos with codespell tool 2021-09-29 21:11:16 +00:00
COPYING * all: re-import of the CVS. 2004-06-03 19:29:33 +00:00
Makefile loongarch: Improve the performance of sad/sad_x3/sad_x4 series functions 2023-10-10 09:09:52 +08:00
autocomplete.c Bump dates to 2023 2023-01-28 16:37:02 +03:00
config.guess loongarch: Init LSX/LASX support 2023-10-10 09:00:09 +08:00
config.sub loongarch: Init LSX/LASX support 2023-10-10 09:00:09 +08:00
configure loongarch: Init LSX/LASX support 2023-10-10 09:00:09 +08:00
example.c Bump dates to 2023 2023-01-28 16:37:02 +03:00
version.sh Simplify version.sh 2015-08-19 01:00:22 +03:00
x264.c Improve qpfile parsing resiliency 2023-04-02 15:51:50 +03:00
x264.h loongarch: Init LSX/LASX support 2023-10-10 09:00:09 +08:00
x264cli.h Bump dates to 2023 2023-01-28 16:37:02 +03:00
x264dll.c Bump dates to 2023 2023-01-28 16:37:02 +03:00
x264res.manifest Add support for long filenames on Windows 10 2020-09-27 23:32:10 +02:00
x264res.rc Bump dates to 2023 2023-01-28 16:37:02 +03:00