wget2 2.2.1
Loading...
Searching...
No Matches
XML parsing functions

Functions

int wget_xml_parse_buffer (const char *buf, wget_xml_callback *callback, void *user_ctx, int hints)
void wget_html_parse_buffer (const char *buf, wget_xml_callback *callback, void *user_ctx, int hints)
void wget_xml_parse_file (const char *fname, wget_xml_callback *callback, void *user_ctx, int hints)
void wget_html_parse_file (const char *fname, wget_xml_callback *callback, void *user_ctx, int hints)
char * wget_xml_decode_entities_inline (char *src)

Detailed Description

Function Documentation

◆ wget_xml_parse_buffer()

int wget_xml_parse_buffer ( const char * buf,
wget_xml_callback * callback,
void * user_ctx,
int hints )
Parameters
[in]bufZero-terminated XML or HTML input data
[in]callbackFunction called for each token scan result
[in]user_ctxUser-defined context variable, handed to callback
[in]hintsFlags to influence parsing

This function scans the XML input from buf and calls callback for each token found. user_ctx is a user-defined context variable and given to each call of callback.

hints may be 0 or any combination of XML_HINT_REMOVE_EMPTY_CONTENT and XML_HINT_HTML.

XML_HINT_REMOVE_EMPTY_CONTENT reduces the number of calls to callback by ignoring empty content and superfluous spaces.

XML_HINT_HTML turns on HTML scanning.

◆ wget_html_parse_buffer()

void wget_html_parse_buffer ( const char * buf,
wget_xml_callback * callback,
void * user_ctx,
int hints )
Parameters
[in]bufZero-terminated HTML input data
[in]callbackFunction called for each token scan result
[in]user_ctxUser-defined context variable, handed to callback
[in]hintsFlags to influence parsing

Convenience function that calls wget_xml_parse_buffer() with HTML parsing turned on.

◆ wget_xml_parse_file()

void wget_xml_parse_file ( const char * fname,
wget_xml_callback * callback,
void * user_ctx,
int hints )
Parameters
[in]fnameName of XML or HTML input file
[in]callbackFunction called for each token scan result
[in]user_ctxUser-defined context variable, handed to callback
[in]hintsFlags to influence parsing

Convenience function that calls wget_xml_parse_buffer() with the file content.

If fname is -, the data is read from stdin.

◆ wget_html_parse_file()

void wget_html_parse_file ( const char * fname,
wget_xml_callback * callback,
void * user_ctx,
int hints )
Parameters
[in]fnameName of XML or HTML input file
[in]callbackFunction called for each token scan result
[in]user_ctxUser-defined context variable, handed to callback
[in]hintsFlags to influence parsing

Convenience function that calls wget_xml_parse_file() with HTML parsing turned on.

If fname is -, the data is read from stdin.

◆ wget_xml_decode_entities_inline()

char * wget_xml_decode_entities_inline ( char * src)
Parameters
[in]srcA string
Returns
A pointer to src, after the XML entities have been converted

Decode XML entities from src.

The transformation is done inline, so src will be modified after this function returns. If no XML entities have been found, src is left untouched.

Only a small subset of available XML entities is currently recognized.