UnicodeType

class UnicodeType

These are the possible character classifications from the Unicode specification. See Unicode Character Database <http://www.unicode.org/reports/tr44/``General_Category_Values`>`_.

Fields

class UnicodeType
CLOSE_PUNCTUATION

General category “Punctuation, Close” (Pe)

CONNECT_PUNCTUATION

General category “Punctuation, Connector” (Pc)

CONTROL

General category “Other, Control” (Cc)

CURRENCY_SYMBOL

General category “Symbol, Currency” (Sc)

DASH_PUNCTUATION

General category “Punctuation, Dash” (Pd)

DECIMAL_NUMBER

General category “Number, Decimal Digit” (Nd)

ENCLOSING_MARK

General category “Mark, Enclosing” (Me)

FINAL_PUNCTUATION

General category “Punctuation, Final quote” (Pf)

FORMAT

General category “Other, Format” (Cf)

INITIAL_PUNCTUATION

General category “Punctuation, Initial quote” (Pi)

LETTER_NUMBER

General category “Number, Letter” (Nl)

LINE_SEPARATOR

General category “Separator, Line” (Zl)

LOWERCASE_LETTER

General category “Letter, Lowercase” (Ll)

MATH_SYMBOL

General category “Symbol, Math” (Sm)

MODIFIER_LETTER

General category “Letter, Modifier” (Lm)

MODIFIER_SYMBOL

General category “Symbol, Modifier” (Sk)

NON_SPACING_MARK

General category “Mark, Nonspacing” (Mn)

OPEN_PUNCTUATION

General category “Punctuation, Open” (Ps)

OTHER_LETTER

General category “Letter, Other” (Lo)

OTHER_NUMBER

General category “Number, Other” (No)

OTHER_PUNCTUATION

General category “Punctuation, Other” (Po)

OTHER_SYMBOL

General category “Symbol, Other” (So)

PARAGRAPH_SEPARATOR

General category “Separator, Paragraph” (Zp)

PRIVATE_USE

General category “Other, Private Use” (Co)

SPACE_SEPARATOR

General category “Separator, Space” (Zs)

SPACING_MARK

General category “Mark, Spacing” (Mc)

SURROGATE

General category “Other, Surrogate” (Cs)

TITLECASE_LETTER

General category “Letter, Titlecase” (Lt)

UNASSIGNED

General category “Other, Not Assigned” (Cn)

UPPERCASE_LETTER

General category “Letter, Uppercase” (Lu)