2003-08-23 14:59:31 +02:00
|
|
|
/*****************************************************************************
|
2011-12-14 00:29:36 +01:00
|
|
|
* vlc_charset.h: Unicode UTF-8 wrappers function
|
2003-08-23 14:59:31 +02:00
|
|
|
*****************************************************************************
|
LGPL
Re-license almost all of libVLC and libVLCcore to LGPLv2.1+
This move was authorized by the developers, either:
- by e-mail,
- by vote at the VideoLAN Dev Days 2011,
- on the license website,
- in a contract, oral or written.
No objection was raised, so far.
The developers agreeing are:
Justus Piater
Alexis Ballier
Alexander Bethke
Mohammed Adnène Trojette
Alex Converse
Alexey Sokolov
Alexis de Lattre
Andre Pang
Anthony Loiseau
Cyril Deguet
André Weber
Boris Dorès
Brieuc Jeunhomme
Benjamin Drung
Hugo Beauzée-Luyssen
Benoit Steiner
Benjamin Pracht
Bernie Purcell
Przemyslaw Fiala
Arnaud de Bossoreille de Ribou
Brad Smith
Nick Briggs
Christopher Rath
Christophe Courtaut
Christopher Mueller
Clement Chesnin
Andres Krapf
Damien Fouilleul
David Flynn
Sebastien Zwickert
Antoine Cellerier
Jérôme Decoodt
Jérome Decoodt
Dylan Yudaken
Eduard Babayan
Eugenio Jarosiewicz
Elliot Murphy
Eric Petit
Erwan Tulou
Etienne Membrives
Ludovic Fauvet
Fabio Ritrovato
Tobias Güntner
Jakub Wieczorek
Frédéric Crozat
Francois Cartegnie
Laurent Aimar
Florian G. Pflug
Felix Paul Kühne
Frank Enderle
Rafaël Carré
Simon Latapie
Gildas Bazin
Geoffroy Couprie
Julien / Gellule
Gildas Bazin
Arnaud Schauly
Toralf Niebuhr
Vicente Jimenez Aguilar
Derk-Jan Hartman
Henri Fallon
Ilkka Ollakka
Olivier Teulière
Rémi Duraffort
Jakob Leben
Jean-Baptiste Kempf
Jean-Paul Saman
Jean-Philippe Grimaldi
Jean-François Massol
Gaël Hendryckx
Jakob Leben
Jean-Marc Dressler
Jai Menon
Johan Bilien
Johann Ransay
Joris van Rooij
JP Dinger
Jean-Philippe André
Adrien Grand
Juha Jeronen
Juho Vähä-Herttua
Kaarlo Raiha
Kaarlo Raiha
Kamil Baldyga
Keary Griffin
Ken Self
KO Myung-Hun
Pierre Ynard
Filippo Carone
Loïc Minier
Luca Barbato
Lucas C. Villa Real
Lukas Durfina
Adrien Maglo
Marc Ariberti
Mark Lee
Mark Moriarty
Martin Storsjö
Christophe Massiot
Michel Kaempf
Marian Ďurkovič
Mirsal Ennaime
Carlo Calabrò
Damien Lucas
Naohiro Koriyama
Basos G
Pierre Baillet
Vincent Penquerc'h
Olivier Aubert
Pankaj Yadav
Paul Corke
Pierre d'Herbemont
Philippe Morin
Antoine Lejeune
Michael Ploujnikov
Jean-Marc Dressler
Michael Hanselmann
Rafaël Carré
Ramiro Polla
Rémi Denis-Courmont
Renaud Dartus
Richard Shepherd
Faustino Osuna
Arnaud Vallat
Rob Jonson
Robert Jedrzejczyk
Steve Lhomme
Rocky Bernstein
Romain Goyet
Rov Juvano
Sam Hocevar
Martin T. H. Sandsmark
Sebastian Birk
Sébastien Escudier
Vincent Seguin
Fabio Ritrovato
Sigmund Augdal Helberg
Casian Andrei
Srikanth Raju
Hannes Domani
Stéphane Borel
Stephan Krempel
Stephan Assmus
Tony Castley
Pavlov Konstantin
Eric Petit
Tanguy Krotoff
Dennis van Amerongen
Michel Lespinasse
Can Wu
Xavier Marchesini
Sébastien Toque
Christophe Mutricy
Yoann Peronneau
Yohann Martineau
Yuval Tze
Scott Caudle
Clément Stenac
It is possible, that some minor piece of code was badly tracked, for
some reasons (SVN, mainly) or that some small developers did not answer.
However, as an "œuvre collective", defined as in "CPI 113-2 alinéa 3",
and seeing "Cour. Cass. 17 Mai 1978", and seeing that the editor and
the very vast majority of developers have agreed (> 99.99% of the code,
> 99% of developers), we are fine here.
2011-11-27 21:44:15 +01:00
|
|
|
* Copyright (C) 2003-2005 VLC authors and VideoLAN
|
2010-02-07 14:18:02 +01:00
|
|
|
* Copyright © 2005-2010 Rémi Denis-Courmont
|
2003-08-23 14:59:31 +02:00
|
|
|
*
|
2019-09-04 21:10:51 +02:00
|
|
|
* Author: Rémi Denis-Courmont
|
2003-08-23 14:59:31 +02:00
|
|
|
*
|
LGPL
Re-license almost all of libVLC and libVLCcore to LGPLv2.1+
This move was authorized by the developers, either:
- by e-mail,
- by vote at the VideoLAN Dev Days 2011,
- on the license website,
- in a contract, oral or written.
No objection was raised, so far.
The developers agreeing are:
Justus Piater
Alexis Ballier
Alexander Bethke
Mohammed Adnène Trojette
Alex Converse
Alexey Sokolov
Alexis de Lattre
Andre Pang
Anthony Loiseau
Cyril Deguet
André Weber
Boris Dorès
Brieuc Jeunhomme
Benjamin Drung
Hugo Beauzée-Luyssen
Benoit Steiner
Benjamin Pracht
Bernie Purcell
Przemyslaw Fiala
Arnaud de Bossoreille de Ribou
Brad Smith
Nick Briggs
Christopher Rath
Christophe Courtaut
Christopher Mueller
Clement Chesnin
Andres Krapf
Damien Fouilleul
David Flynn
Sebastien Zwickert
Antoine Cellerier
Jérôme Decoodt
Jérome Decoodt
Dylan Yudaken
Eduard Babayan
Eugenio Jarosiewicz
Elliot Murphy
Eric Petit
Erwan Tulou
Etienne Membrives
Ludovic Fauvet
Fabio Ritrovato
Tobias Güntner
Jakub Wieczorek
Frédéric Crozat
Francois Cartegnie
Laurent Aimar
Florian G. Pflug
Felix Paul Kühne
Frank Enderle
Rafaël Carré
Simon Latapie
Gildas Bazin
Geoffroy Couprie
Julien / Gellule
Gildas Bazin
Arnaud Schauly
Toralf Niebuhr
Vicente Jimenez Aguilar
Derk-Jan Hartman
Henri Fallon
Ilkka Ollakka
Olivier Teulière
Rémi Duraffort
Jakob Leben
Jean-Baptiste Kempf
Jean-Paul Saman
Jean-Philippe Grimaldi
Jean-François Massol
Gaël Hendryckx
Jakob Leben
Jean-Marc Dressler
Jai Menon
Johan Bilien
Johann Ransay
Joris van Rooij
JP Dinger
Jean-Philippe André
Adrien Grand
Juha Jeronen
Juho Vähä-Herttua
Kaarlo Raiha
Kaarlo Raiha
Kamil Baldyga
Keary Griffin
Ken Self
KO Myung-Hun
Pierre Ynard
Filippo Carone
Loïc Minier
Luca Barbato
Lucas C. Villa Real
Lukas Durfina
Adrien Maglo
Marc Ariberti
Mark Lee
Mark Moriarty
Martin Storsjö
Christophe Massiot
Michel Kaempf
Marian Ďurkovič
Mirsal Ennaime
Carlo Calabrò
Damien Lucas
Naohiro Koriyama
Basos G
Pierre Baillet
Vincent Penquerc'h
Olivier Aubert
Pankaj Yadav
Paul Corke
Pierre d'Herbemont
Philippe Morin
Antoine Lejeune
Michael Ploujnikov
Jean-Marc Dressler
Michael Hanselmann
Rafaël Carré
Ramiro Polla
Rémi Denis-Courmont
Renaud Dartus
Richard Shepherd
Faustino Osuna
Arnaud Vallat
Rob Jonson
Robert Jedrzejczyk
Steve Lhomme
Rocky Bernstein
Romain Goyet
Rov Juvano
Sam Hocevar
Martin T. H. Sandsmark
Sebastian Birk
Sébastien Escudier
Vincent Seguin
Fabio Ritrovato
Sigmund Augdal Helberg
Casian Andrei
Srikanth Raju
Hannes Domani
Stéphane Borel
Stephan Krempel
Stephan Assmus
Tony Castley
Pavlov Konstantin
Eric Petit
Tanguy Krotoff
Dennis van Amerongen
Michel Lespinasse
Can Wu
Xavier Marchesini
Sébastien Toque
Christophe Mutricy
Yoann Peronneau
Yohann Martineau
Yuval Tze
Scott Caudle
Clément Stenac
It is possible, that some minor piece of code was badly tracked, for
some reasons (SVN, mainly) or that some small developers did not answer.
However, as an "œuvre collective", defined as in "CPI 113-2 alinéa 3",
and seeing "Cour. Cass. 17 Mai 1978", and seeing that the editor and
the very vast majority of developers have agreed (> 99.99% of the code,
> 99% of developers), we are fine here.
2011-11-27 21:44:15 +01:00
|
|
|
* This program is free software; you can redistribute it and/or modify it
|
|
|
|
* under the terms of the GNU Lesser General Public License as published by
|
|
|
|
* the Free Software Foundation; either version 2.1 of the License, or
|
2003-08-23 14:59:31 +02:00
|
|
|
* (at your option) any later version.
|
2004-01-25 19:17:08 +01:00
|
|
|
*
|
2003-08-23 14:59:31 +02:00
|
|
|
* This program is distributed in the hope that it will be useful,
|
|
|
|
* but WITHOUT ANY WARRANTY; without even the implied warranty of
|
LGPL
Re-license almost all of libVLC and libVLCcore to LGPLv2.1+
This move was authorized by the developers, either:
- by e-mail,
- by vote at the VideoLAN Dev Days 2011,
- on the license website,
- in a contract, oral or written.
No objection was raised, so far.
The developers agreeing are:
Justus Piater
Alexis Ballier
Alexander Bethke
Mohammed Adnène Trojette
Alex Converse
Alexey Sokolov
Alexis de Lattre
Andre Pang
Anthony Loiseau
Cyril Deguet
André Weber
Boris Dorès
Brieuc Jeunhomme
Benjamin Drung
Hugo Beauzée-Luyssen
Benoit Steiner
Benjamin Pracht
Bernie Purcell
Przemyslaw Fiala
Arnaud de Bossoreille de Ribou
Brad Smith
Nick Briggs
Christopher Rath
Christophe Courtaut
Christopher Mueller
Clement Chesnin
Andres Krapf
Damien Fouilleul
David Flynn
Sebastien Zwickert
Antoine Cellerier
Jérôme Decoodt
Jérome Decoodt
Dylan Yudaken
Eduard Babayan
Eugenio Jarosiewicz
Elliot Murphy
Eric Petit
Erwan Tulou
Etienne Membrives
Ludovic Fauvet
Fabio Ritrovato
Tobias Güntner
Jakub Wieczorek
Frédéric Crozat
Francois Cartegnie
Laurent Aimar
Florian G. Pflug
Felix Paul Kühne
Frank Enderle
Rafaël Carré
Simon Latapie
Gildas Bazin
Geoffroy Couprie
Julien / Gellule
Gildas Bazin
Arnaud Schauly
Toralf Niebuhr
Vicente Jimenez Aguilar
Derk-Jan Hartman
Henri Fallon
Ilkka Ollakka
Olivier Teulière
Rémi Duraffort
Jakob Leben
Jean-Baptiste Kempf
Jean-Paul Saman
Jean-Philippe Grimaldi
Jean-François Massol
Gaël Hendryckx
Jakob Leben
Jean-Marc Dressler
Jai Menon
Johan Bilien
Johann Ransay
Joris van Rooij
JP Dinger
Jean-Philippe André
Adrien Grand
Juha Jeronen
Juho Vähä-Herttua
Kaarlo Raiha
Kaarlo Raiha
Kamil Baldyga
Keary Griffin
Ken Self
KO Myung-Hun
Pierre Ynard
Filippo Carone
Loïc Minier
Luca Barbato
Lucas C. Villa Real
Lukas Durfina
Adrien Maglo
Marc Ariberti
Mark Lee
Mark Moriarty
Martin Storsjö
Christophe Massiot
Michel Kaempf
Marian Ďurkovič
Mirsal Ennaime
Carlo Calabrò
Damien Lucas
Naohiro Koriyama
Basos G
Pierre Baillet
Vincent Penquerc'h
Olivier Aubert
Pankaj Yadav
Paul Corke
Pierre d'Herbemont
Philippe Morin
Antoine Lejeune
Michael Ploujnikov
Jean-Marc Dressler
Michael Hanselmann
Rafaël Carré
Ramiro Polla
Rémi Denis-Courmont
Renaud Dartus
Richard Shepherd
Faustino Osuna
Arnaud Vallat
Rob Jonson
Robert Jedrzejczyk
Steve Lhomme
Rocky Bernstein
Romain Goyet
Rov Juvano
Sam Hocevar
Martin T. H. Sandsmark
Sebastian Birk
Sébastien Escudier
Vincent Seguin
Fabio Ritrovato
Sigmund Augdal Helberg
Casian Andrei
Srikanth Raju
Hannes Domani
Stéphane Borel
Stephan Krempel
Stephan Assmus
Tony Castley
Pavlov Konstantin
Eric Petit
Tanguy Krotoff
Dennis van Amerongen
Michel Lespinasse
Can Wu
Xavier Marchesini
Sébastien Toque
Christophe Mutricy
Yoann Peronneau
Yohann Martineau
Yuval Tze
Scott Caudle
Clément Stenac
It is possible, that some minor piece of code was badly tracked, for
some reasons (SVN, mainly) or that some small developers did not answer.
However, as an "œuvre collective", defined as in "CPI 113-2 alinéa 3",
and seeing "Cour. Cass. 17 Mai 1978", and seeing that the editor and
the very vast majority of developers have agreed (> 99.99% of the code,
> 99% of developers), we are fine here.
2011-11-27 21:44:15 +01:00
|
|
|
* MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
|
|
|
|
* GNU Lesser General Public License for more details.
|
2003-08-23 14:59:31 +02:00
|
|
|
*
|
LGPL
Re-license almost all of libVLC and libVLCcore to LGPLv2.1+
This move was authorized by the developers, either:
- by e-mail,
- by vote at the VideoLAN Dev Days 2011,
- on the license website,
- in a contract, oral or written.
No objection was raised, so far.
The developers agreeing are:
Justus Piater
Alexis Ballier
Alexander Bethke
Mohammed Adnène Trojette
Alex Converse
Alexey Sokolov
Alexis de Lattre
Andre Pang
Anthony Loiseau
Cyril Deguet
André Weber
Boris Dorès
Brieuc Jeunhomme
Benjamin Drung
Hugo Beauzée-Luyssen
Benoit Steiner
Benjamin Pracht
Bernie Purcell
Przemyslaw Fiala
Arnaud de Bossoreille de Ribou
Brad Smith
Nick Briggs
Christopher Rath
Christophe Courtaut
Christopher Mueller
Clement Chesnin
Andres Krapf
Damien Fouilleul
David Flynn
Sebastien Zwickert
Antoine Cellerier
Jérôme Decoodt
Jérome Decoodt
Dylan Yudaken
Eduard Babayan
Eugenio Jarosiewicz
Elliot Murphy
Eric Petit
Erwan Tulou
Etienne Membrives
Ludovic Fauvet
Fabio Ritrovato
Tobias Güntner
Jakub Wieczorek
Frédéric Crozat
Francois Cartegnie
Laurent Aimar
Florian G. Pflug
Felix Paul Kühne
Frank Enderle
Rafaël Carré
Simon Latapie
Gildas Bazin
Geoffroy Couprie
Julien / Gellule
Gildas Bazin
Arnaud Schauly
Toralf Niebuhr
Vicente Jimenez Aguilar
Derk-Jan Hartman
Henri Fallon
Ilkka Ollakka
Olivier Teulière
Rémi Duraffort
Jakob Leben
Jean-Baptiste Kempf
Jean-Paul Saman
Jean-Philippe Grimaldi
Jean-François Massol
Gaël Hendryckx
Jakob Leben
Jean-Marc Dressler
Jai Menon
Johan Bilien
Johann Ransay
Joris van Rooij
JP Dinger
Jean-Philippe André
Adrien Grand
Juha Jeronen
Juho Vähä-Herttua
Kaarlo Raiha
Kaarlo Raiha
Kamil Baldyga
Keary Griffin
Ken Self
KO Myung-Hun
Pierre Ynard
Filippo Carone
Loïc Minier
Luca Barbato
Lucas C. Villa Real
Lukas Durfina
Adrien Maglo
Marc Ariberti
Mark Lee
Mark Moriarty
Martin Storsjö
Christophe Massiot
Michel Kaempf
Marian Ďurkovič
Mirsal Ennaime
Carlo Calabrò
Damien Lucas
Naohiro Koriyama
Basos G
Pierre Baillet
Vincent Penquerc'h
Olivier Aubert
Pankaj Yadav
Paul Corke
Pierre d'Herbemont
Philippe Morin
Antoine Lejeune
Michael Ploujnikov
Jean-Marc Dressler
Michael Hanselmann
Rafaël Carré
Ramiro Polla
Rémi Denis-Courmont
Renaud Dartus
Richard Shepherd
Faustino Osuna
Arnaud Vallat
Rob Jonson
Robert Jedrzejczyk
Steve Lhomme
Rocky Bernstein
Romain Goyet
Rov Juvano
Sam Hocevar
Martin T. H. Sandsmark
Sebastian Birk
Sébastien Escudier
Vincent Seguin
Fabio Ritrovato
Sigmund Augdal Helberg
Casian Andrei
Srikanth Raju
Hannes Domani
Stéphane Borel
Stephan Krempel
Stephan Assmus
Tony Castley
Pavlov Konstantin
Eric Petit
Tanguy Krotoff
Dennis van Amerongen
Michel Lespinasse
Can Wu
Xavier Marchesini
Sébastien Toque
Christophe Mutricy
Yoann Peronneau
Yohann Martineau
Yuval Tze
Scott Caudle
Clément Stenac
It is possible, that some minor piece of code was badly tracked, for
some reasons (SVN, mainly) or that some small developers did not answer.
However, as an "œuvre collective", defined as in "CPI 113-2 alinéa 3",
and seeing "Cour. Cass. 17 Mai 1978", and seeing that the editor and
the very vast majority of developers have agreed (> 99.99% of the code,
> 99% of developers), we are fine here.
2011-11-27 21:44:15 +01:00
|
|
|
* You should have received a copy of the GNU Lesser General Public License
|
|
|
|
* along with this program; if not, write to the Free Software Foundation,
|
|
|
|
* Inc., 51 Franklin Street, Fifth Floor, Boston MA 02110-1301, USA.
|
2003-08-23 14:59:31 +02:00
|
|
|
*****************************************************************************/
|
|
|
|
|
2008-08-11 18:13:10 +02:00
|
|
|
#ifndef VLC_CHARSET_H
|
|
|
|
#define VLC_CHARSET_H 1
|
2005-10-10 09:56:33 +02:00
|
|
|
|
2008-08-11 17:31:57 +02:00
|
|
|
/**
|
2020-04-11 22:11:20 +02:00
|
|
|
* \file vlc_charset.h
|
|
|
|
* \ingroup charset
|
|
|
|
* \defgroup charset Character sets
|
2015-11-29 13:21:04 +01:00
|
|
|
* \ingroup strings
|
|
|
|
* @{
|
2008-08-11 17:31:57 +02:00
|
|
|
*/
|
|
|
|
|
2015-11-29 13:21:04 +01:00
|
|
|
/**
|
|
|
|
* Decodes a code point from UTF-8.
|
|
|
|
*
|
|
|
|
* Converts the first character in a UTF-8 sequence into a Unicode code point.
|
|
|
|
*
|
|
|
|
* \param str an UTF-8 bytes sequence [IN]
|
|
|
|
* \param pwc address of a location to store the code point [OUT]
|
|
|
|
*
|
|
|
|
* \return the number of bytes occupied by the decoded code point
|
|
|
|
*
|
2023-02-07 12:25:12 +01:00
|
|
|
* \retval -1 not a valid UTF-8 sequence
|
2015-11-29 13:21:04 +01:00
|
|
|
* \retval 0 null character (i.e. str points to an empty string)
|
|
|
|
* \retval 1 (non-null) ASCII character
|
|
|
|
* \retval 2-4 non-ASCII character
|
|
|
|
*/
|
2023-02-07 12:25:12 +01:00
|
|
|
VLC_API ssize_t vlc_towc(const char *str, uint32_t *restrict pwc);
|
2015-11-29 13:21:04 +01:00
|
|
|
|
2015-11-29 13:31:49 +01:00
|
|
|
/**
|
|
|
|
* Checks UTF-8 validity.
|
|
|
|
*
|
|
|
|
* Checks whether a null-terminated string is a valid UTF-8 bytes sequence.
|
|
|
|
*
|
|
|
|
* \param str string to check
|
|
|
|
*
|
|
|
|
* \retval str the string is a valid null-terminated UTF-8 sequence
|
|
|
|
* \retval NULL the string is not an UTF-8 sequence
|
|
|
|
*/
|
|
|
|
VLC_USED static inline const char *IsUTF8(const char *str)
|
|
|
|
{
|
2023-02-07 12:25:12 +01:00
|
|
|
ssize_t n;
|
2015-11-29 13:31:49 +01:00
|
|
|
uint32_t cp;
|
|
|
|
|
|
|
|
while ((n = vlc_towc(str, &cp)) != 0)
|
2023-02-07 12:25:12 +01:00
|
|
|
if (likely(n != -1))
|
2015-11-29 13:31:49 +01:00
|
|
|
str += n;
|
|
|
|
else
|
|
|
|
return NULL;
|
|
|
|
return str;
|
|
|
|
}
|
|
|
|
|
2018-07-07 10:27:29 +02:00
|
|
|
/**
|
|
|
|
* Checks ASCII validity.
|
|
|
|
*
|
|
|
|
* Checks whether a null-terminated string is a valid ASCII bytes sequence
|
|
|
|
* (non-printable ASCII characters 1-31 are permitted).
|
|
|
|
*
|
|
|
|
* \param str string to check
|
|
|
|
*
|
|
|
|
* \retval str the string is a valid null-terminated ASCII sequence
|
|
|
|
* \retval NULL the string is not an ASCII sequence
|
|
|
|
*/
|
|
|
|
VLC_USED static inline const char *IsASCII(const char *str)
|
|
|
|
{
|
|
|
|
unsigned char c;
|
|
|
|
|
|
|
|
for (const char *p = str; (c = *p) != '\0'; p++)
|
|
|
|
if (c >= 0x80)
|
|
|
|
return NULL;
|
|
|
|
return str;
|
|
|
|
}
|
|
|
|
|
2015-11-29 13:31:49 +01:00
|
|
|
/**
|
|
|
|
* Removes non-UTF-8 sequences.
|
|
|
|
*
|
|
|
|
* Replaces invalid or <i>over-long</i> UTF-8 bytes sequences within a
|
|
|
|
* null-terminated string with question marks. This is so that the string can
|
|
|
|
* be printed at least partially.
|
|
|
|
*
|
|
|
|
* \warning Do not use this were correctness is critical. use IsUTF8() and
|
|
|
|
* handle the error case instead. This function is mainly for display or debug.
|
|
|
|
*
|
|
|
|
* \note Converting from Latin-1 to UTF-8 in place is not possible (the string
|
|
|
|
* size would be increased). So it is not attempted even if it would otherwise
|
|
|
|
* be less disruptive.
|
|
|
|
*
|
|
|
|
* \retval str the string is a valid null-terminated UTF-8 sequence
|
|
|
|
* (i.e. no changes were made)
|
|
|
|
* \retval NULL the string is not an UTF-8 sequence
|
|
|
|
*/
|
|
|
|
static inline char *EnsureUTF8(char *str)
|
|
|
|
{
|
|
|
|
char *ret = str;
|
2023-02-07 12:25:12 +01:00
|
|
|
ssize_t n;
|
2015-11-29 13:31:49 +01:00
|
|
|
uint32_t cp;
|
|
|
|
|
|
|
|
while ((n = vlc_towc(str, &cp)) != 0)
|
2023-02-07 12:25:12 +01:00
|
|
|
if (likely(n != -1))
|
2015-11-29 13:31:49 +01:00
|
|
|
str += n;
|
|
|
|
else
|
|
|
|
{
|
|
|
|
*str++ = '?';
|
|
|
|
ret = NULL;
|
|
|
|
}
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2020-04-11 22:11:20 +02:00
|
|
|
/**
|
|
|
|
* \defgroup iconv iconv wrappers
|
|
|
|
*
|
|
|
|
* (defined in src/extras/libc.c)
|
|
|
|
* @{
|
|
|
|
*/
|
|
|
|
|
2016-01-23 20:28:19 +01:00
|
|
|
#define VLC_ICONV_ERR ((size_t) -1)
|
2010-10-26 19:11:35 +02:00
|
|
|
typedef void *vlc_iconv_t;
|
2011-05-07 22:02:08 +02:00
|
|
|
VLC_API vlc_iconv_t vlc_iconv_open( const char *, const char * ) VLC_USED;
|
|
|
|
VLC_API size_t vlc_iconv( vlc_iconv_t, const char **, size_t *, char **, size_t * ) VLC_USED;
|
2011-05-07 13:06:21 +02:00
|
|
|
VLC_API int vlc_iconv_close( vlc_iconv_t );
|
2010-10-26 19:11:35 +02:00
|
|
|
|
2020-04-11 22:11:20 +02:00
|
|
|
/** @} */
|
|
|
|
|
2006-08-31 16:55:20 +02:00
|
|
|
#include <stdarg.h>
|
|
|
|
|
2011-05-07 13:06:21 +02:00
|
|
|
VLC_API int utf8_vfprintf( FILE *stream, const char *fmt, va_list ap );
|
2011-05-07 22:02:08 +02:00
|
|
|
VLC_API int utf8_fprintf( FILE *, const char *, ... ) VLC_FORMAT( 2, 3 );
|
|
|
|
VLC_API char * vlc_strcasestr(const char *, const char *) VLC_USED;
|
2006-02-21 13:14:27 +01:00
|
|
|
|
2012-03-19 21:11:36 +01:00
|
|
|
VLC_API char * FromCharset( const char *charset, const void *data, size_t data_size ) VLC_USED;
|
|
|
|
VLC_API void * ToCharset( const char *charset, const char *in, size_t *outsize ) VLC_USED;
|
|
|
|
|
2018-10-08 10:51:33 +02:00
|
|
|
#ifdef __APPLE__
|
|
|
|
# include <CoreFoundation/CoreFoundation.h>
|
|
|
|
|
|
|
|
/* Obtains a copy of the contents of a CFString in specified encoding.
|
|
|
|
* Returns char* (must be freed by caller) or NULL on failure.
|
|
|
|
*/
|
|
|
|
VLC_USED static inline char *FromCFString(const CFStringRef cfString,
|
|
|
|
const CFStringEncoding cfStringEncoding)
|
|
|
|
{
|
|
|
|
// Try the quick way to obtain the buffer
|
|
|
|
const char *tmpBuffer = CFStringGetCStringPtr(cfString, cfStringEncoding);
|
|
|
|
|
|
|
|
if (tmpBuffer != NULL) {
|
|
|
|
return strdup(tmpBuffer);
|
|
|
|
}
|
|
|
|
|
|
|
|
// The quick way did not work, try the long way
|
|
|
|
CFIndex length = CFStringGetLength(cfString);
|
|
|
|
CFIndex maxSize =
|
|
|
|
CFStringGetMaximumSizeForEncoding(length, cfStringEncoding);
|
|
|
|
|
|
|
|
// If result would exceed LONG_MAX, kCFNotFound is returned
|
|
|
|
if (unlikely(maxSize == kCFNotFound)) {
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
// Account for the null terminator
|
|
|
|
maxSize++;
|
|
|
|
|
|
|
|
char *buffer = (char *)malloc(maxSize);
|
|
|
|
|
|
|
|
if (unlikely(buffer == NULL)) {
|
|
|
|
return NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
// Copy CFString in requested encoding to buffer
|
|
|
|
Boolean success = CFStringGetCString(cfString, buffer, maxSize, cfStringEncoding);
|
|
|
|
|
|
|
|
if (!success)
|
|
|
|
FREENULL(buffer);
|
|
|
|
return buffer;
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2013-06-05 15:41:18 +02:00
|
|
|
#ifdef _WIN32
|
2023-07-04 08:46:54 +02:00
|
|
|
# include <windows.h>
|
|
|
|
|
2011-05-07 22:02:08 +02:00
|
|
|
VLC_USED
|
2006-11-11 13:48:49 +01:00
|
|
|
static inline char *FromWide (const wchar_t *wide)
|
|
|
|
{
|
|
|
|
size_t len = WideCharToMultiByte (CP_UTF8, 0, wide, -1, NULL, 0, NULL, NULL);
|
|
|
|
if (len == 0)
|
|
|
|
return NULL;
|
|
|
|
|
2006-11-11 18:10:59 +01:00
|
|
|
char *out = (char *)malloc (len);
|
2006-11-11 13:48:49 +01:00
|
|
|
|
2010-07-22 18:53:21 +02:00
|
|
|
if (likely(out))
|
2008-08-24 20:15:56 +02:00
|
|
|
WideCharToMultiByte (CP_UTF8, 0, wide, -1, out, len, NULL, NULL);
|
2006-11-11 13:48:49 +01:00
|
|
|
return out;
|
|
|
|
}
|
2010-07-22 18:53:21 +02:00
|
|
|
|
2011-05-07 22:02:08 +02:00
|
|
|
VLC_USED
|
2010-07-22 18:53:21 +02:00
|
|
|
static inline wchar_t *ToWide (const char *utf8)
|
|
|
|
{
|
|
|
|
int len = MultiByteToWideChar (CP_UTF8, 0, utf8, -1, NULL, 0);
|
|
|
|
if (len == 0)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
wchar_t *out = (wchar_t *)malloc (len * sizeof (wchar_t));
|
|
|
|
|
|
|
|
if (likely(out))
|
|
|
|
MultiByteToWideChar (CP_UTF8, 0, utf8, -1, out, len);
|
|
|
|
return out;
|
|
|
|
}
|
2012-01-28 16:14:55 +01:00
|
|
|
|
2012-03-22 17:38:21 +01:00
|
|
|
VLC_USED VLC_MALLOC
|
|
|
|
static inline char *ToCodePage (unsigned cp, const char *utf8)
|
|
|
|
{
|
|
|
|
wchar_t *wide = ToWide (utf8);
|
|
|
|
if (wide == NULL)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
size_t len = WideCharToMultiByte (cp, 0, wide, -1, NULL, 0, NULL, NULL);
|
2015-11-23 08:31:46 +01:00
|
|
|
if (len == 0) {
|
|
|
|
free(wide);
|
2012-03-22 17:38:21 +01:00
|
|
|
return NULL;
|
2015-11-23 08:31:46 +01:00
|
|
|
}
|
2012-03-22 17:38:21 +01:00
|
|
|
|
|
|
|
char *out = (char *)malloc (len);
|
|
|
|
if (likely(out != NULL))
|
|
|
|
WideCharToMultiByte (cp, 0, wide, -1, out, len, NULL, NULL);
|
|
|
|
free (wide);
|
|
|
|
return out;
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED VLC_MALLOC
|
|
|
|
static inline char *FromCodePage (unsigned cp, const char *mb)
|
|
|
|
{
|
|
|
|
int len = MultiByteToWideChar (cp, 0, mb, -1, NULL, 0);
|
|
|
|
if (len == 0)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
wchar_t *wide = (wchar_t *)malloc (len * sizeof (wchar_t));
|
|
|
|
if (unlikely(wide == NULL))
|
|
|
|
return NULL;
|
|
|
|
MultiByteToWideChar (cp, 0, mb, -1, wide, len);
|
|
|
|
|
|
|
|
char *utf8 = FromWide (wide);
|
|
|
|
free (wide);
|
|
|
|
return utf8;
|
|
|
|
}
|
|
|
|
|
2012-03-22 17:46:22 +01:00
|
|
|
VLC_USED VLC_MALLOC
|
|
|
|
static inline char *FromANSI (const char *ansi)
|
|
|
|
{
|
|
|
|
return FromCodePage (GetACP (), ansi);
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED VLC_MALLOC
|
|
|
|
static inline char *ToANSI (const char *utf8)
|
|
|
|
{
|
|
|
|
return ToCodePage (GetACP (), utf8);
|
|
|
|
}
|
|
|
|
|
|
|
|
# define FromLocale FromANSI
|
2012-03-22 18:15:39 +01:00
|
|
|
# define ToLocale ToANSI
|
2012-03-22 17:46:22 +01:00
|
|
|
# define LocaleFree(s) free((char *)(s))
|
|
|
|
# define FromLocaleDup FromANSI
|
|
|
|
# define ToLocaleDup ToANSI
|
2012-03-22 05:41:47 +01:00
|
|
|
|
|
|
|
#elif defined(__OS2__)
|
|
|
|
|
|
|
|
VLC_USED static inline char *FromLocale (const char *locale)
|
|
|
|
{
|
|
|
|
return locale ? FromCharset ((char *)"", locale, strlen(locale)) : NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED static inline char *ToLocale (const char *utf8)
|
|
|
|
{
|
|
|
|
size_t outsize;
|
|
|
|
return utf8 ? (char *)ToCharset ("", utf8, &outsize) : NULL;
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED static inline void LocaleFree (const char *str)
|
|
|
|
{
|
|
|
|
free ((char *)str);
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED static inline char *FromLocaleDup (const char *locale)
|
|
|
|
{
|
|
|
|
return FromCharset ("", locale, strlen(locale));
|
|
|
|
}
|
|
|
|
|
|
|
|
VLC_USED static inline char *ToLocaleDup (const char *utf8)
|
|
|
|
{
|
|
|
|
size_t outsize;
|
|
|
|
return (char *)ToCharset ("", utf8, &outsize);
|
|
|
|
}
|
|
|
|
|
|
|
|
#else
|
|
|
|
|
|
|
|
# define FromLocale(l) (l)
|
|
|
|
# define ToLocale(u) (u)
|
|
|
|
# define LocaleFree(s) ((void)(s))
|
|
|
|
# define FromLocaleDup strdup
|
|
|
|
# define ToLocaleDup strdup
|
2006-11-11 12:37:21 +01:00
|
|
|
#endif
|
2006-03-21 17:42:34 +01:00
|
|
|
|
2008-12-16 18:58:10 +01:00
|
|
|
/**
|
|
|
|
* Converts a nul-terminated string from ISO-8859-1 to UTF-8.
|
|
|
|
*/
|
|
|
|
static inline char *FromLatin1 (const char *latin)
|
|
|
|
{
|
2008-12-16 19:12:00 +01:00
|
|
|
char *str = (char *)malloc (2 * strlen (latin) + 1), *utf8 = str;
|
2008-12-16 18:58:10 +01:00
|
|
|
unsigned char c;
|
|
|
|
|
|
|
|
if (str == NULL)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
while ((c = *(latin++)) != '\0')
|
|
|
|
{
|
|
|
|
if (c >= 0x80)
|
|
|
|
{
|
|
|
|
*(utf8++) = 0xC0 | (c >> 6);
|
|
|
|
*(utf8++) = 0x80 | (c & 0x3F);
|
|
|
|
}
|
|
|
|
else
|
|
|
|
*(utf8++) = c;
|
|
|
|
}
|
|
|
|
*(utf8++) = '\0';
|
|
|
|
|
2008-12-16 19:12:00 +01:00
|
|
|
utf8 = (char *)realloc (str, utf8 - str);
|
2008-12-16 18:58:10 +01:00
|
|
|
return utf8 ? utf8 : str;
|
|
|
|
}
|
|
|
|
|
2020-04-11 22:11:20 +02:00
|
|
|
/**
|
|
|
|
* \defgroup c_locale C/POSIX locale functions
|
|
|
|
* @{
|
|
|
|
*/
|
2022-05-26 20:24:09 +02:00
|
|
|
|
|
|
|
/**
|
|
|
|
* Parses a double in C locale.
|
|
|
|
*
|
|
|
|
* This function parses a double-precision floating point number from a string
|
|
|
|
* just like the standard strtod() but it uses the C locale. In other words, it
|
|
|
|
* expects the POSIX/C/American decimal format regardless of the current
|
|
|
|
* numeric locale.
|
|
|
|
*
|
|
|
|
* \param str nul-terminated string to parse
|
|
|
|
* \param[out] end storage space for a pointer to the first unparsed byte
|
|
|
|
* (or NULL to discard it)
|
|
|
|
* \return the parsed double value (zero if no character could be parsed)
|
|
|
|
*/
|
2022-05-26 20:27:49 +02:00
|
|
|
VLC_API double vlc_strtod_c(const char *restrict str, char **restrict end)
|
2022-05-26 20:24:09 +02:00
|
|
|
VLC_USED;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Parses a float in C locale.
|
|
|
|
*
|
|
|
|
* This function parses a single-precision floating point number from a string
|
|
|
|
* just like the standard strtof() but it uses the C locale. In other words, it
|
|
|
|
* expects the POSIX/C/American decimal format regardless of the current
|
|
|
|
* numeric locale.
|
|
|
|
*
|
|
|
|
* \param str nul-terminated string to parse
|
|
|
|
* \param[out] end storage space for a pointer to the first unparsed byte
|
|
|
|
* (or NULL to discard it)
|
|
|
|
* \return the parsed double value (zero if no character could be parsed)
|
|
|
|
*/
|
2022-05-26 20:27:49 +02:00
|
|
|
VLC_API float vlc_strtof_c(const char *restrict str, char **restrict end)
|
2022-05-26 20:24:09 +02:00
|
|
|
VLC_USED;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Parses a double in C locale.
|
|
|
|
*
|
|
|
|
* This function parses a double-precision floating point number from a string
|
|
|
|
* just like the standard atof() but it uses the C locale. In other words, it
|
|
|
|
* expects the POSIX/C/American decimal format regardless of the current
|
|
|
|
* numeric locale.
|
|
|
|
*
|
|
|
|
* \param str nul-terminated string to parse
|
|
|
|
* \return the parsed double value (zero if no character could be parsed)
|
|
|
|
*/
|
2022-05-26 20:27:49 +02:00
|
|
|
VLC_USED static inline double vlc_atof_c(const char *str)
|
2022-05-26 20:26:25 +02:00
|
|
|
{
|
2022-05-26 20:27:49 +02:00
|
|
|
return vlc_strtod_c(str, NULL);
|
2022-05-26 20:26:25 +02:00
|
|
|
}
|
2022-05-26 20:24:09 +02:00
|
|
|
|
|
|
|
/**
|
|
|
|
* Formats a string using the C locale.
|
|
|
|
*
|
|
|
|
* This function formats a string from a format string and a variable argument
|
|
|
|
* list, just like the standard vasprintf() but using the C locale for the
|
|
|
|
* formatting of numerals.
|
|
|
|
*
|
|
|
|
* \param[out] p storage space for a pointer to the heap-allocated formatted
|
|
|
|
* string (undefined on error)
|
|
|
|
* \param fmt format string
|
|
|
|
* \param ap variable argument list
|
|
|
|
* \return number of bytes formatted (excluding the nul terminator)
|
|
|
|
* or -1 on error
|
|
|
|
*/
|
2022-05-26 20:27:49 +02:00
|
|
|
VLC_API int vlc_vasprintf_c(char **restrict p, const char *restrict fmt,
|
|
|
|
va_list ap) VLC_USED;
|
2022-05-26 20:24:09 +02:00
|
|
|
|
|
|
|
/**
|
|
|
|
* Formats a string using the C locale.
|
|
|
|
*
|
|
|
|
* This function formats a string from a format string and a variable argument
|
2023-10-24 08:23:03 +02:00
|
|
|
* list, just like the standard asprintf() but using the C locale for the
|
2022-05-26 20:24:09 +02:00
|
|
|
* formatting of numerals.
|
|
|
|
*
|
|
|
|
* \param[out] p storage space for a pointer to the heap-allocated formatted
|
|
|
|
* string (undefined on error)
|
|
|
|
* \param fmt format string
|
|
|
|
* \return number of bytes formatted (excluding the nul terminator)
|
|
|
|
* or -1 on error
|
|
|
|
*/
|
2023-07-06 11:02:50 +02:00
|
|
|
VLC_API int vlc_asprintf_c( char **p, const char *fmt, ... ) VLC_USED;
|
2022-05-26 20:24:09 +02:00
|
|
|
|
2023-10-24 08:23:37 +02:00
|
|
|
/**
|
|
|
|
* Write a string to the output using the C locale
|
|
|
|
*
|
|
|
|
* This function formats a string from a format string and a variable argument
|
|
|
|
* list, just like the standard vfprintf() but using the C locale for the
|
|
|
|
* formatting of numerals.
|
|
|
|
*
|
|
|
|
* \param f output stream to write the string to
|
|
|
|
* \param fmt format string
|
|
|
|
* \param ap variable argument list
|
|
|
|
* \return number of bytes formatted (excluding the nul terminator)
|
|
|
|
* or -1 on error
|
|
|
|
*/
|
|
|
|
VLC_API int vlc_vfprintf_c(FILE *f, const char *fmt, va_list ap);
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Write a string to the output using the C locale
|
|
|
|
*
|
|
|
|
* This function formats a string from a format string and a variable argument
|
|
|
|
* list, just like the standard fprintf() but using the C locale for the
|
|
|
|
* formatting of numerals.
|
|
|
|
*
|
|
|
|
* \param f output stream to write the string to
|
|
|
|
* \param fmt format string
|
|
|
|
* \return number of bytes formatted (excluding the nul terminator)
|
|
|
|
* or -1 on error
|
|
|
|
*/
|
|
|
|
VLC_API int vlc_fprintf_c(FILE *f, const char *fmt, ...);
|
|
|
|
|
2022-05-26 20:42:43 +02:00
|
|
|
int vlc_vsscanf_c(const char *, const char *, va_list) VLC_USED;
|
|
|
|
int vlc_sscanf_c(const char*, const char*, ...) VLC_USED
|
|
|
|
#ifdef __GNUC__
|
|
|
|
__attribute__((format(scanf, 2, 3)))
|
|
|
|
#endif
|
|
|
|
;
|
|
|
|
|
2020-04-11 22:11:20 +02:00
|
|
|
/** @} */
|
|
|
|
/** @} */
|
2004-02-22 00:15:52 +01:00
|
|
|
|
2005-10-10 09:56:33 +02:00
|
|
|
#endif
|