Skip to content

Commit

Permalink
Update unicode data to 15.1
Browse files Browse the repository at this point in the history
Fix #998
  • Loading branch information
wengxt committed Mar 26, 2024
1 parent e8572f9 commit f45c693
Show file tree
Hide file tree
Showing 2 changed files with 64 additions and 9 deletions.
Binary file modified src/modules/unicode/charselectdata
Binary file not shown.
73 changes: 64 additions & 9 deletions src/modules/unicode/gen.py
Original file line number Diff line number Diff line change
Expand Up @@ -91,15 +91,19 @@
sectiondata = '''
SECTION European Scripts
Armenian
Carian
Caucasian Albanian
Cypriot Syllabary
Cypro-Minoan
Cyrillic
Cyrillic Supplement
Cyrillic Extended-A
Cyrillic Extended-B
Cyrillic Extended-C
Cyrillic Extended-D
Elbasan
Georgian
Georgian Extended
Georgian Supplement
Glagolitic
Glagolitic Supplement
Expand All @@ -114,6 +118,8 @@
Latin Extended-C
Latin Extended-D
Latin Extended-E
Latin Extended-F
Latin Extended-G
Latin Extended Additional
IPA Extensions
Phonetic Extensions
Expand All @@ -122,13 +128,16 @@
Linear B Syllabary
Linear B Ideograms
Aegean Numbers
Lycian
Lydian
Ogham
Old Hungarian
Old Italic
Old Permic
Phaistos Disc
Runic
Shavian
Vithkuqi
SECTION Modifier Letters
Modifier Tone Letters
Expand All @@ -142,6 +151,10 @@
Combining Diacritical Marks for Symbols
Combining Half Marks
SECTION Miscellaneous
Alphabetic Presentation Forms
Halfwidth and Fullwidth Forms
SECTION African Scripts
Adlam
Bamum
Expand All @@ -150,10 +163,13 @@
Coptic
Coptic Epact Numbers
Egyptian Hieroglyphs
Egyptian Hieroglyph Format Controls
Ethiopic
Ethiopic Supplement
Ethiopic Extended
Ethiopic Extended-A
Ethiopic Extended-B
Medefaidrin
Mende Kikakui
Meroitic Cursive
Meroitic Hieroglyphs
Expand All @@ -167,20 +183,21 @@
Arabic
Arabic Supplement
Arabic Extended-A
Arabic Extended-B
Arabic Extended-C
Arabic Presentation Forms-A
Arabic Presentation Forms-B
Imperial Aramaic
Avestan
Carian
Chorasmian
Cuneiform
Cuneiform Numbers and Punctuation
Early Dynastic Cuneiform
Old Persian
Ugaritic
Elymaic
Hatran
Hebrew
Lycian
Lydian
Mandaic
Nabataean
Old North Arabian
Expand All @@ -192,15 +209,22 @@
Phoenician
Samaritan
Syriac
Syriac Supplement
Yezidi
SECTION Central Asian Scripts
Manichaean
Marchen
Mongolian
Mongolian Supplement
Old Sogdian
Old Turkic
Old Uyghur
Phags-pa
Sogdian
Soyombo
Tibetan
Zanabazar Square
SECTION South Asian Scripts
Ahom
Expand All @@ -210,8 +234,12 @@
Chakma
Devanagari
Devanagari Extended
Devanagari Extended-A
Dives Akuru
Dogra
Grantha
Gujarati
Gunjala Gondi
Gurmukhi
Kaithi
Kannada
Expand All @@ -222,11 +250,14 @@
Limbu
Mahajani
Malayalam
Masaram Gondi
Meetei Mayek
Meetei Mayek Extensions
Modi
Mro
Multani
Nag Mundari
Nandinagari
Newa
Ol Chiki
Oriya
Expand All @@ -239,14 +270,18 @@
Syloti Nagri
Takri
Tamil
Tamil Supplement
Telugu
Thaana
Tirhuta
Toto
Vedic Extensions
Wancho
Warang Citi
SECTION Southeast Asian Scripts
Cham
Hanifi Rohingya
Kayah Li
Khmer
Khmer Symbols
Expand All @@ -255,11 +290,13 @@
Myanmar Extended-A
Myanmar Extended-B
New Tai Lue
Nyiakeng Puachue Hmong
Pahawh Hmong
Pau Cin Hau
Tai Le
Tai Tham
Tai Viet
Tangsa
Thai
SECTION Indonesia & Oceania Scripts
Expand All @@ -269,6 +306,8 @@
Buhid
Hanunoo
Javanese
Kawi
Makasar
Rejang
Sundanese
Sundanese Supplement
Expand All @@ -284,6 +323,10 @@
CJK Unified Ideographs Extension C
CJK Unified Ideographs Extension D
CJK Unified Ideographs Extension E
CJK Unified Ideographs Extension F
CJK Unified Ideographs Extension G
CJK Unified Ideographs Extension H
CJK Unified Ideographs Extension I
CJK Compatibility Ideographs
CJK Compatibility Ideographs Supplement
Kangxi Radicals
Expand All @@ -296,14 +339,21 @@
Hangul Compatibility Jamo
Hangul Syllables
Hiragana
Katakana
Katakana Phonetic Extensions
Kana Extended-A
Kana Extended-B
Kana Supplement
Small Kana Extension
Kanbun
Katakana
Katakana Phonetic Extensions
Khitan Small Script
Lisu
Lisu Supplement
Miao
Nushu
Tangut
Tangut Components
Tangut Supplement
Yi Syllables
Yi Radicals
Expand All @@ -314,16 +364,14 @@
Osage
Unified Canadian Aboriginal Syllabics
Unified Canadian Aboriginal Syllabics Extended
SECTION Other
Alphabetic Presentation Forms
Halfwidth and Fullwidth Forms
Unified Canadian Aboriginal Syllabics Extended-A
SECTION Notational Systems
Braille Patterns
Musical Symbols
Ancient Greek Musical Notation
Byzantine Musical Symbols
Znamenny Musical Notation
Duployan
Shorthand Format Controls
Sutton SignWriting
Expand Down Expand Up @@ -358,7 +406,11 @@
Coptic Epact Numbers
Counting Rod Numerals
Cuneiform Numbers and Punctuation
Indic Siyaq Numbers
Kaktovik Numerals
Mayan Numerals
Number Forms
Ottoman Siyaq Numbers
Rumi Numeral Symbols
Sinhala Archaic Numbers
Expand Down Expand Up @@ -387,16 +439,19 @@
Miscellaneous Symbols
Miscellaneous Symbols and Pictographs
Supplemental Symbols and Pictographs
Symbols and Pictographs Extended-A
Transport and Map Symbols
SECTION Other Symbols
Alchemical Symbols
Ancient Symbols
Currency Symbols
Chess Symbols
Domino Tiles
Mahjong Tiles
Playing Cards
Miscellaneous Symbols and Arrows
Symbols for Legacy Computing
Yijing Hexagram Symbols
Tai Xuan Jing Symbols
Expand Down

0 comments on commit f45c693

Please sign in to comment.