java.lang
Class Character.UnicodeBlock
public static final
class
Character.UnicodeBlock
extends Character.Subset
A family of character subsets in the Unicode specification. A character
is in at most one of these blocks.
This inner class was generated automatically from
doc/unicode/Blocks-4.0.0.txt
, by some perl scripts.
This Unicode definition file can be found on the
http://www.unicode.org website.
JDK 1.5 uses Unicode version 4.0.0.
Since: 1.2
Aegean Numbers.
0x10100 - 0x1013F.
Since: 1.5
Alphabetic Presentation Forms.
0xFB00 - 0xFB4F.
Arabic.
0x0600 - 0x06FF.
Arabic Presentation Forms-A.
0xFB50 - 0xFDFF.
Arabic Presentation Forms-B.
0xFE70 - 0xFEFF.
Armenian.
0x0530 - 0x058F.
Arrows.
0x2190 - 0x21FF.
Basic Latin.
0x0000 - 0x007F.
Bengali.
0x0980 - 0x09FF.
Block Elements.
0x2580 - 0x259F.
Bopomofo.
0x3100 - 0x312F.
Bopomofo Extended.
0x31A0 - 0x31BF.
Since: 1.4
Box Drawing.
0x2500 - 0x257F.
Braille Patterns.
0x2800 - 0x28FF.
Since: 1.4
Buhid.
0x1740 - 0x175F.
Since: 1.5
Byzantine Musical Symbols.
0x1D000 - 0x1D0FF.
Since: 1.5
Cherokee.
0x13A0 - 0x13FF.
Since: 1.4
CJK Compatibility.
0x3300 - 0x33FF.
CJK Compatibility Forms.
0xFE30 - 0xFE4F.
CJK Compatibility Ideographs.
0xF900 - 0xFAFF.
CJK Compatibility Ideographs Supplement.
0x2F800 - 0x2FA1F.
Since: 1.5
CJK Radicals Supplement.
0x2E80 - 0x2EFF.
Since: 1.4
CJK Symbols and Punctuation.
0x3000 - 0x303F.
CJK Unified Ideographs.
0x4E00 - 0x9FFF.
CJK Unified Ideographs Extension A.
0x3400 - 0x4DBF.
Since: 1.4
CJK Unified Ideographs Extension B.
0x20000 - 0x2A6DF.
Since: 1.5
Combining Diacritical Marks.
0x0300 - 0x036F.
Combining Half Marks.
0xFE20 - 0xFE2F.
Combining Marks for Symbols.
0x20D0 - 0x20FF.
Control Pictures.
0x2400 - 0x243F.
Currency Symbols.
0x20A0 - 0x20CF.
Cypriot Syllabary.
0x10800 - 0x1083F.
Since: 1.5
Cyrillic.
0x0400 - 0x04FF.
Cyrillic Supplementary.
0x0500 - 0x052F.
Since: 1.5
Deseret.
0x10400 - 0x1044F.
Since: 1.5
Devanagari.
0x0900 - 0x097F.
Dingbats.
0x2700 - 0x27BF.
Enclosed Alphanumerics.
0x2460 - 0x24FF.
Enclosed CJK Letters and Months.
0x3200 - 0x32FF.
Ethiopic.
0x1200 - 0x137F.
Since: 1.4
General Punctuation.
0x2000 - 0x206F.
Geometric Shapes.
0x25A0 - 0x25FF.
Georgian.
0x10A0 - 0x10FF.
Gothic.
0x10330 - 0x1034F.
Since: 1.5
Greek.
0x0370 - 0x03FF.
Greek Extended.
0x1F00 - 0x1FFF.
Gujarati.
0x0A80 - 0x0AFF.
Gurmukhi.
0x0A00 - 0x0A7F.
Halfwidth and Fullwidth Forms.
0xFF00 - 0xFFEF.
Hangul Compatibility Jamo.
0x3130 - 0x318F.
Hangul Jamo.
0x1100 - 0x11FF.
Hangul Syllables.
0xAC00 - 0xD7AF.
Hanunoo.
0x1720 - 0x173F.
Since: 1.5
Hebrew.
0x0590 - 0x05FF.
High Private Use Surrogates.
0xDB80 - 0xDBFF.
Since: 1.5
High Surrogates.
0xD800 - 0xDB7F.
Since: 1.5
Hiragana.
0x3040 - 0x309F.
Ideographic Description Characters.
0x2FF0 - 0x2FFF.
Since: 1.4
IPA Extensions.
0x0250 - 0x02AF.
Kanbun.
0x3190 - 0x319F.
Kangxi Radicals.
0x2F00 - 0x2FDF.
Since: 1.4
Kannada.
0x0C80 - 0x0CFF.
Katakana.
0x30A0 - 0x30FF.
Katakana Phonetic Extensions.
0x31F0 - 0x31FF.
Since: 1.5
Khmer.
0x1780 - 0x17FF.
Since: 1.4
Khmer Symbols.
0x19E0 - 0x19FF.
Since: 1.5
Lao.
0x0E80 - 0x0EFF.
Latin-1 Supplement.
0x0080 - 0x00FF.
Latin Extended-A.
0x0100 - 0x017F.
Latin Extended Additional.
0x1E00 - 0x1EFF.
Latin Extended-B.
0x0180 - 0x024F.
Letterlike Symbols.
0x2100 - 0x214F.
Limbu.
0x1900 - 0x194F.
Since: 1.5
Linear B Ideograms.
0x10080 - 0x100FF.
Since: 1.5
Linear B Syllabary.
0x10000 - 0x1007F.
Since: 1.5
Low Surrogates.
0xDC00 - 0xDFFF.
Since: 1.5
Malayalam.
0x0D00 - 0x0D7F.
Mathematical Alphanumeric Symbols.
0x1D400 - 0x1D7FF.
Since: 1.5
Mathematical Operators.
0x2200 - 0x22FF.
Miscellaneous Mathematical Symbols-A.
0x27C0 - 0x27EF.
Since: 1.5
Miscellaneous Mathematical Symbols-B.
0x2980 - 0x29FF.
Since: 1.5
Miscellaneous Symbols.
0x2600 - 0x26FF.
Miscellaneous Symbols and Arrows.
0x2B00 - 0x2BFF.
Since: 1.5
Miscellaneous Technical.
0x2300 - 0x23FF.
Mongolian.
0x1800 - 0x18AF.
Since: 1.4
Musical Symbols.
0x1D100 - 0x1D1FF.
Since: 1.5
Myanmar.
0x1000 - 0x109F.
Since: 1.4
Number Forms.
0x2150 - 0x218F.
Ogham.
0x1680 - 0x169F.
Since: 1.4
Old Italic.
0x10300 - 0x1032F.
Since: 1.5
Optical Character Recognition.
0x2440 - 0x245F.
Oriya.
0x0B00 - 0x0B7F.
Osmanya.
0x10480 - 0x104AF.
Since: 1.5
Phonetic Extensions.
0x1D00 - 0x1D7F.
Since: 1.5
Private Use Area.
0xE000 - 0xF8FF.
Runic.
0x16A0 - 0x16FF.
Since: 1.4
Shavian.
0x10450 - 0x1047F.
Since: 1.5
Sinhala.
0x0D80 - 0x0DFF.
Since: 1.4
Small Form Variants.
0xFE50 - 0xFE6F.
Spacing Modifier Letters.
0x02B0 - 0x02FF.
Specials.
0xFFF0 - 0xFFFF.
Superscripts and Subscripts.
0x2070 - 0x209F.
Supplemental Arrows-A.
0x27F0 - 0x27FF.
Since: 1.5
Supplemental Arrows-B.
0x2900 - 0x297F.
Since: 1.5
Supplemental Mathematical Operators.
0x2A00 - 0x2AFF.
Since: 1.5
Supplementary Private Use Area-A.
0xF0000 - 0xFFFFF.
Since: 1.5
Supplementary Private Use Area-B.
0x100000 - 0x10FFFF.
Since: 1.5
Syriac.
0x0700 - 0x074F.
Since: 1.4
Tagalog.
0x1700 - 0x171F.
Since: 1.5
Tagbanwa.
0x1760 - 0x177F.
Since: 1.5
Tags.
0xE0000 - 0xE007F.
Since: 1.5
Tai Le.
0x1950 - 0x197F.
Since: 1.5
Tai Xuan Jing Symbols.
0x1D300 - 0x1D35F.
Since: 1.5
Tamil.
0x0B80 - 0x0BFF.
Telugu.
0x0C00 - 0x0C7F.
Thaana.
0x0780 - 0x07BF.
Since: 1.4
Thai.
0x0E00 - 0x0E7F.
Tibetan.
0x0F00 - 0x0FFF.
Ugaritic.
0x10380 - 0x1039F.
Since: 1.5
Unified Canadian Aboriginal Syllabics.
0x1400 - 0x167F.
Since: 1.4
Variation Selectors.
0xFE00 - 0xFE0F.
Since: 1.5
Variation Selectors Supplement.
0xE0100 - 0xE01EF.
Since: 1.5
Yijing Hexagram Symbols.
0x4DC0 - 0x4DFF.
Since: 1.5
Yi Radicals.
0xA490 - 0xA4CF.
Since: 1.4
Yi Syllables.
0xA000 - 0xA48F.
Since: 1.4
Returns the UnicodeBlock
with the given name, as defined
by the Unicode standard. The version of Unicode in use is defined by
the Character
class, and the names are given in the
Blocks-.txt
file corresponding to that version.
The name may be specified in one of three ways:
- The canonical, human-readable name used by the Unicode standard.
This is the name with all spaces and hyphens retained. For example,
`Basic Latin' retrieves the block, UnicodeBlock.BASIC_LATIN.
- The canonical name with all spaces removed e.g. `BasicLatin'.
- The name used for the constants specified by this class, which
is the canonical name with all spaces and hyphens replaced with
underscores e.g. `BASIC_LATIN'
The names are compared case-insensitively using the case comparison
associated with the U.S. English locale. The method recognises the
previous names used for blocks as well as the current ones. At
present, this simply means that the deprecated `SURROGATES_AREA'
will be recognised by this method (the of()
methods
only return one of the three new surrogate blocks).
Parameters: blockName the name of the block to look up.
Returns: the specified block.
Throws: NullPointerException if the blockName
is
null
. IllegalArgumentException if the name does not match any Unicode
block.
Since: 1.5
Returns the Unicode character block which a character belongs to.
Note: This method does not support the use of
supplementary characters. For such support,
of(int)
should be used instead.
Parameters: ch the character to look up
Returns: the set it belongs to, or null if it is not in one
Returns the Unicode character block which a code point belongs to.
Parameters: codePoint the character to look up
Returns: the set it belongs to, or null if it is not in one.
Throws: IllegalArgumentException if the specified code point is
invalid.
Since: 1.5