Python unicodedata library is_normalized unidata_version and ucd_3_2_0














































Python unicodedata library is_normalized unidata_version and ucd_3_2_0



Python Unicode data:
The 'unicodedata' library in python is helpful in defining the properties for all the unicode characters available in the Unicode database. This library accesses the Unicode Character Database (UCD) for defining the characters.
UCD file link: https://www.unicode.org/Public/9.0.0/ucd/UnicodeData.txt

The next function defined here is:
1. unicodedata.is_normalized(form, unistr)

unicodedata Library provides some attributes and objects also:
1. unicodedata.unidata_version
2. unicodedata.ucd_3_2_0

Let us discuss about these in detail.

unicodedata.is_normalized(form, unistr):
As explained in the previous article, Unicode Normalization helps to solve the compatibility and canonical equivalences.
As stated before, in both, decompositions and compositions, they have 2 types of conversions each:
Decomposition:
1. NFD- Canonical Decomposition
2. NFKD- Compatibility Decomposition
Composition:
1. NFC- Canonical Decomposition followed by Canonical Composition
2. NFKC-Compatibility Decomposition followed by Canonical Composition
While unicodedata.normalize(form unistr) function helps to normalize the string 'unistr' to the stated form 'form' given as the argument to it, unicodedata.is_normalized(from, unistr) tells us whether the string 'unistr' is in the stated normal form or not.
The first parameter 'form' can take the values: 'NFD', 'NFC', 'NFKD', 'NFKC'.
The second parameter takes the string which needs to be checked if it is in the normal from or not.
The function returns 'True' or 'False' depending upon the answer.

EXAMPLE-1:



EXAMPLE-2:


Module Attributes and Objects:

1. unicodedata.unidata_version

This is an attribute in unicodedata library that has the version of the Unicode Database used in the module.



2. unicodedata.ucd_3_2_0

This is an object in unicodedata library that has the same functions and methods as the entire view, but uses the Unicode database version 3.2 instead, for applications that require this specific version of Unicode Database(such as IDNA)
This provides ucd3.2 object mode access in order to be compatible with the old IDNA Applications






More Articles of Arkaja Sharan:

Name Views Likes
Python codecs Library Error Handling schemes module functions 120 0
Python codecs Library Error Handler register_error and lookup_error functions 120 0
Python codecs Library Error Handlers 137 0
Python codecs Library open and EncodedFile functions 120 0
Python codecs Library iterencode and iterdecode functions 136 0
Python codecs Library register and unregister functions 104 0
Python codecs Library getreader and getwriter functions 121 0
Python codecs Library getincrementalencoder and getincrementaldecoder 103 0
Python codecs Library getencoder and getdecoder functions 115 0
Python Introduction to codecs Library 140 0
Python fcntl Library flock and lockf functions 126 0
Python fcntl Library fcntl and ioctl functions 142 0
Python Resource Library resource usage functions 125 0
Python Resource Library resource usage symbolic constants 108 0
Python Resource Library Resource Limit Functions 128 0
Python resource library resource limit symbolic constants 121 0
Python Introduction to Resource Library 132 0
Python stringprep Library in_table_d1 and in_table_d2 functions 117 0
Python stringprep Library in_table_c8 and in_table_c9 functions 112 0
Python stringprep Library in_table_c5 in_table_c6 and in_table_c7 functions 106 0
Python stringprep Library in_table_c3 and in_table_c4 functions 110 0
Python stringprep library in_table_c21 in_table_c22 and in_table_c21_c22 116 0
Python stringprep library functions in_table_c11 in_table_c12 and in_table_c11_c12 113 0
Python Introduction to stringprep Library 125 0
Python unicodedata library is_normalized unidata_version and ucd_3_2_0 112 0
Python Unicodedata Library functions normalize and decomposition 167 0
Python Unicodedata Library functions east_asian_width and mirrored 112 1
Python Unicodedata Library category bidirectional and combining functions 163 0
Introduction to Unicodedata library lookup and name functions 112 0
Unicode Library decimal digit and numeric functions 118 0
Introduction to Unicode Data library 0 0

Comments