ScannerConfig

class ScannerConfig(*args, **kwargs)

Specifies the Scanner parser configuration. Most settings can be changed during the parsing phase and will affect the lexical parsing of the next unpeeked token.

Fields

class ScannerConfig
case_sensitive

Specifies if symbols are case sensitive (the default is False).

char_2_token

Specifies if characters are reported by setting token = ch or as CHAR (the default is True).

cpair_comment_single

Specifies the characters at the start and end of single-line comments. The default is “#n” which means that single-line comments start with a ‘#’ and continue until a ‘n’ (end of line).

cset_identifier_first

Specifies the characters which can start identifiers (the default is CSET_A_2_Z, “_”, and CSET_A_2_Z).

cset_identifier_nth

Specifies the characters which can be used in identifiers, after the first character (the default is CSET_A_2_Z, “_0123456789”, CSET_A_2_Z, %G_CSET_LATINS, %G_CSET_LATINC).

cset_skip_characters

Specifies which characters should be skipped by the scanner (the default is the whitespace characters: space, tab, carriage-return and line-feed).

identifier_2_string

Specifies if identifiers are reported as strings (the default is False).

int_2_float

Specifies if all numbers are reported as FLOAT (the default is False).

numbers_2_int

Specifies if binary, octal and hexadecimal numbers are reported as INT (the default is True).

padding_dummy
scan_binary

Specifies if binary numbers are recognized (the default is False).

scan_comment_multi

Specifies if multi-line comments are recognized (the default is True).

scan_float

Specifies if floating point numbers are recognized (the default is True).

scan_hex

Specifies if hexadecimal numbers are recognized (the default is True).

scan_hex_dollar

Specifies if ‘$’ is recognized as a prefix for hexadecimal numbers (the default is False).

scan_identifier

Specifies if identifiers are recognized (the default is True).

scan_identifier_1char

Specifies if single-character identifiers are recognized (the default is False).

scan_identifier_NULL
scan_octal

Specifies if octal numbers are recognized (the default is True).

scan_string_dq

Specifies if strings can be enclosed in double quotes (the default is True).

scan_string_sq

Specifies if strings can be enclosed in single quotes (the default is True).

scan_symbols

Specifies if symbols are recognized (the default is True).

scope_0_fallback

Specifies if a symbol is searched for in the default scope in addition to the current scope (the default is False).

skip_comment_multi

Specifies if multi-line comments are skipped and not returned as tokens (the default is True).

skip_comment_single

Specifies if single-line comments are skipped and not returned as tokens (the default is True).

store_int64

Use value.v_int64 rather than v_int

symbol_2_token

Specifies if symbols are reported by setting token = v_symbol or as SYMBOL (the default is False).