java.lang
public final class Character extends Object implements Serializable, Comparable<Character>
For predicates, boundaries are used to describe the set of characters for which the method will return true. This syntax uses fairly normal regular expression notation. See 5.13 of the Unicode Standard, Version 4.0, for the boundary specification.
See http://www.unicode.org for more information on the Unicode Standard.
Since: 1.0
See Also: CharData
UNKNOWN: partly updated to 1.5; some things still missing
Nested Class Summary | |
---|---|
static class | Character.Subset
A subset of Unicode blocks.
|
static class | Character.UnicodeBlock
A family of character subsets in the Unicode specification. |
Field Summary | |
---|---|
static byte | COMBINING_SPACING_MARK
Mc = Mark, Spacing Combining (Normative).
|
static byte | CONNECTOR_PUNCTUATION
Pc = Punctuation, Connector (Informative).
|
static byte | CONTROL
Cc = Other, Control (Normative).
|
static byte | CURRENCY_SYMBOL
Sc = Symbol, Currency (Informative).
|
static byte | DASH_PUNCTUATION
Pd = Punctuation, Dash (Informative).
|
static byte | DECIMAL_DIGIT_NUMBER
Nd = Number, Decimal Digit (Normative).
|
static byte | DIRECTIONALITY_ARABIC_NUMBER
Weak bidirectional character type "AN".
|
static byte | DIRECTIONALITY_BOUNDARY_NEUTRAL
Weak bidirectional character type "BN".
|
static byte | DIRECTIONALITY_COMMON_NUMBER_SEPARATOR
Weak bidirectional character type "CS".
|
static byte | DIRECTIONALITY_EUROPEAN_NUMBER
Weak bidirectional character type "EN".
|
static byte | DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR
Weak bidirectional character type "ES".
|
static byte | DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR
Weak bidirectional character type "ET".
|
static byte | DIRECTIONALITY_LEFT_TO_RIGHT
Strong bidirectional character type "L".
|
static byte | DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING
Strong bidirectional character type "LRE".
|
static byte | DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE
Strong bidirectional character type "LRO".
|
static byte | DIRECTIONALITY_NONSPACING_MARK
Weak bidirectional character type "NSM".
|
static byte | DIRECTIONALITY_OTHER_NEUTRALS
Neutral bidirectional character type "ON".
|
static byte | DIRECTIONALITY_PARAGRAPH_SEPARATOR
Neutral bidirectional character type "B".
|
static byte | DIRECTIONALITY_POP_DIRECTIONAL_FORMAT
Weak bidirectional character type "PDF".
|
static byte | DIRECTIONALITY_RIGHT_TO_LEFT
Strong bidirectional character type "R".
|
static byte | DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC
Strong bidirectional character type "AL".
|
static byte | DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING
Strong bidirectional character type "RLE".
|
static byte | DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE
Strong bidirectional character type "RLO".
|
static byte | DIRECTIONALITY_SEGMENT_SEPARATOR
Neutral bidirectional character type "S".
|
static byte | DIRECTIONALITY_UNDEFINED
Undefined bidirectional character type. |
static byte | DIRECTIONALITY_WHITESPACE
Strong bidirectional character type "WS".
|
static byte | ENCLOSING_MARK
Me = Mark, Enclosing (Normative).
|
static byte | END_PUNCTUATION
Pe = Punctuation, Close (Informative).
|
static byte | FINAL_QUOTE_PUNCTUATION
Pf = Punctuation, Final Quote (Informative).
|
static byte | FORMAT
Cf = Other, Format (Normative).
|
static byte | INITIAL_QUOTE_PUNCTUATION
Pi = Punctuation, Initial Quote (Informative).
|
static byte | LETTER_NUMBER
Nl = Number, Letter (Normative).
|
static byte | LINE_SEPARATOR
Zl = Separator, Line (Normative).
|
static byte | LOWERCASE_LETTER
Ll = Letter, Lowercase (Informative).
|
static byte | MATH_SYMBOL
Sm = Symbol, Math (Informative).
|
static int | MAX_CODE_POINT
The maximum Unicode 4.0 code point, which is greater than the range
of the char data type.
|
static char | MAX_HIGH_SURROGATE
The maximum Unicode high surrogate code unit, or
|
static char | MAX_LOW_SURROGATE
The maximum Unicode low surrogate code unit, or
|
static int | MAX_RADIX
Largest value allowed for radix arguments in Java. |
static char | MAX_SURROGATE
The maximum Unicode surrogate code unit in the UTF-16 character encoding.
|
static char | MAX_VALUE
The maximum value the char data type can hold.
|
static int | MIN_CODE_POINT
The minimum Unicode 4.0 code point. |
static char | MIN_HIGH_SURROGATE
The minimum Unicode high surrogate code unit, or
|
static char | MIN_LOW_SURROGATE
The minimum Unicode low surrogate code unit, or
|
static int | MIN_RADIX
Smallest value allowed for radix arguments in Java. |
static int | MIN_SUPPLEMENTARY_CODE_POINT
The lowest possible supplementary Unicode code point (the first code
point outside the basic multilingual plane (BMP)).
|
static char | MIN_SURROGATE
The minimum Unicode surrogate code unit in the UTF-16 character encoding.
|
static char | MIN_VALUE
The minimum value the char data type can hold.
|
static byte | MODIFIER_LETTER
Lm = Letter, Modifier (Informative).
|
static byte | MODIFIER_SYMBOL
Sk = Symbol, Modifier (Informative).
|
static byte | NON_SPACING_MARK
Mn = Mark, Non-Spacing (Normative).
|
static byte | OTHER_LETTER
Lo = Letter, Other (Informative).
|
static byte | OTHER_NUMBER
No = Number, Other (Normative).
|
static byte | OTHER_PUNCTUATION
Po = Punctuation, Other (Informative).
|
static byte | OTHER_SYMBOL
So = Symbol, Other (Informative).
|
static byte | PARAGRAPH_SEPARATOR
Zp = Separator, Paragraph (Normative).
|
static byte | PRIVATE_USE
Co = Other, Private Use (Normative).
|
static int | SIZE
The number of bits needed to represent a char . |
static byte | SPACE_SEPARATOR
Zs = Separator, Space (Normative).
|
static byte | START_PUNCTUATION
Ps = Punctuation, Open (Informative).
|
static byte | SURROGATE
Cs = Other, Surrogate (Normative).
|
static byte | TITLECASE_LETTER
Lt = Letter, Titlecase (Informative).
|
static Class<Character> | TYPE
Class object representing the primitive char data type.
|
static byte | UNASSIGNED
Cn = Other, Not Assigned (Normative).
|
static byte | UPPERCASE_LETTER
Lu = Letter, Uppercase (Informative).
|
Constructor Summary | |
---|---|
Character(char value)
Wraps up a character.
|
Method Summary | |
---|---|
static int | charCount(int codePoint)
Return number of 16-bit characters required to represent the given
code point.
|
char | charValue()
Returns the character which has been wrapped by this class.
|
static int | codePointAt(CharSequence sequence, int index)
Get the code point at the specified index in the CharSequence.
|
static int | codePointAt(char[] chars, int index)
Get the code point at the specified index in the CharSequence.
|
static int | codePointAt(char[] chars, int index, int limit)
Get the code point at the specified index in the CharSequence.
|
static int | codePointBefore(char[] chars, int index)
Get the code point before the specified index. |
static int | codePointBefore(char[] chars, int index, int start)
Get the code point before the specified index. |
static int | codePointBefore(CharSequence sequence, int index)
Get the code point before the specified index. |
static int | codePointCount(CharSequence seq, int beginIndex, int endIndex)
Returns the number of Unicode code points in the specified range of the
given CharSequence. |
static int | codePointCount(char[] a, int offset, int count)
Returns the number of Unicode code points in the specified range of the
given char array. |
int | compareTo(Character anotherCharacter)
Compares another Character to this Character, numerically.
|
static int | digit(char ch, int radix)
Converts a character into a digit of the specified radix. |
static int | digit(int codePoint, int radix)
Converts a character into a digit of the specified radix. |
boolean | equals(Object o)
Determines if an object is equal to this object. |
static char | forDigit(int digit, int radix)
Converts a digit into a character which represents that digit
in a specified radix. |
static byte | getDirectionality(char ch)
Returns the Unicode directionality property of the character. |
static byte | getDirectionality(int codePoint)
Returns the Unicode directionality property of the character. |
static int | getNumericValue(char ch)
Returns the Unicode numeric value property of a character. |
static int | getNumericValue(int codePoint)
Returns the Unicode numeric value property of a character. |
static int | getType(char ch)
Returns the Unicode general category property of a character.
|
static int | getType(int codePoint)
Returns the Unicode general category property of a character.
|
int | hashCode()
Returns the numerical value (unsigned) of the wrapped character.
|
static boolean | isDefined(char ch)
Determines if a character is part of the Unicode Standard. |
static boolean | isDefined(int codePoint)
Determines if a character is part of the Unicode Standard. |
static boolean | isDigit(char ch)
Determines if a character is a Unicode decimal digit. |
static boolean | isDigit(int codePoint)
Determines if a character is a Unicode decimal digit. |
static boolean | isHighSurrogate(char ch)
Return true if the given character is a high surrogate. |
static boolean | isIdentifierIgnorable(char ch)
Determines if a character is ignorable in a Unicode identifier. |
static boolean | isIdentifierIgnorable(int codePoint)
Determines if a character is ignorable in a Unicode identifier. |
static boolean | isISOControl(char ch)
Determines if a character has the ISO Control property.
|
static boolean | isISOControl(int codePoint)
Determines if the character is an ISO Control character. |
static boolean | isJavaIdentifierPart(char ch)
Determines if a character can follow the first letter in
a Java identifier. |
static boolean | isJavaIdentifierPart(int codePoint)
Determines if a character can follow the first letter in
a Java identifier. |
static boolean | isJavaIdentifierStart(char ch)
Determines if a character can start a Java identifier. |
static boolean | isJavaIdentifierStart(int codePoint)
Determines if a character can start a Java identifier. |
static boolean | isJavaLetter(char ch)
Determines if a character can start a Java identifier. |
static boolean | isJavaLetterOrDigit(char ch)
Determines if a character can follow the first letter in
a Java identifier. |
static boolean | isLetter(char ch)
Determines if a character is a Unicode letter. |
static boolean | isLetter(int codePoint)
Determines if a character is a Unicode letter. |
static boolean | isLetterOrDigit(char ch)
Determines if a character is a Unicode letter or a Unicode digit. |
static boolean | isLetterOrDigit(int codePoint)
Determines if a character is a Unicode letter or a Unicode digit. |
static boolean | isLowerCase(char ch)
Determines if a character is a Unicode lowercase letter. |
static boolean | isLowerCase(int codePoint)
Determines if a character is a Unicode lowercase letter. |
static boolean | isLowSurrogate(char ch)
Return true if the given character is a low surrogate. |
static boolean | isMirrored(char ch)
Determines whether the character is mirrored according to Unicode. |
static boolean | isMirrored(int codePoint)
Determines whether the character is mirrored according to Unicode. |
static boolean | isSpace(char ch)
Determines if a character is a ISO-LATIN-1 space. |
static boolean | isSpaceChar(char ch)
Determines if a character is a Unicode space character. |
static boolean | isSpaceChar(int codePoint)
Determines if a character is a Unicode space character. |
static boolean | isSupplementaryCodePoint(int codePoint)
Determines whether the specified code point is
in the range 0x10000 .. |
static boolean | isSurrogatePair(char ch1, char ch2)
Return true if the given characters compose a surrogate pair.
|
static boolean | isTitleCase(char ch)
Determines if a character is a Unicode titlecase letter. |
static boolean | isTitleCase(int codePoint)
Determines if a character is a Unicode titlecase letter. |
static boolean | isUnicodeIdentifierPart(char ch)
Determines if a character can follow the first letter in
a Unicode identifier. |
static boolean | isUnicodeIdentifierPart(int codePoint)
Determines if a character can follow the first letter in
a Unicode identifier. |
static boolean | isUnicodeIdentifierStart(char ch)
Determines if a character can start a Unicode identifier. |
static boolean | isUnicodeIdentifierStart(int codePoint)
Determines if a character can start a Unicode identifier. |
static boolean | isUpperCase(char ch)
Determines if a character is a Unicode uppercase letter. |
static boolean | isUpperCase(int codePoint)
Determines if a character is a Unicode uppercase letter. |
static boolean | isValidCodePoint(int codePoint)
Determines whether the specified code point is
in the range 0x0000 .. |
static boolean | isWhitespace(char ch)
Determines if a character is Java whitespace. |
static boolean | isWhitespace(int codePoint)
Determines if a character is Java whitespace. |
static int | offsetByCodePoints(CharSequence seq, int index, int codePointOffset)
Returns the index into the given CharSequence that is offset
codePointOffset code points from index . |
static int | offsetByCodePoints(char[] a, int start, int count, int index, int codePointOffset)
Returns the index into the given char subarray that is offset
codePointOffset code points from index . |
static char | reverseBytes(char val)
Reverse the bytes in val. |
static char[] | toChars(int codePoint)
Converts a unicode code point to a UTF-16 representation of that
code point.
|
static int | toChars(int codePoint, char[] dst, int dstIndex)
Converts a unicode code point to its UTF-16 representation.
|
static int | toCodePoint(char high, char low)
Given a valid surrogate pair, this returns the corresponding
code point. |
static char | toLowerCase(char ch)
Converts a Unicode character into its lowercase equivalent mapping.
|
static int | toLowerCase(int codePoint)
Converts a Unicode character into its lowercase equivalent mapping.
|
String | toString()
Converts the wrapped character into a String.
|
static String | toString(char ch)
Returns a String of length 1 representing the specified character.
|
static char | toTitleCase(char ch)
Converts a Unicode character into its titlecase equivalent mapping.
|
static int | toTitleCase(int codePoint)
Converts a Unicode character into its titlecase equivalent mapping.
|
static char | toUpperCase(char ch)
Converts a Unicode character into its uppercase equivalent mapping.
|
static int | toUpperCase(int codePoint)
Converts a Unicode character into its uppercase equivalent mapping.
|
static Character | valueOf(char val)
Returns an Character object wrapping the value.
|
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.4
Since: 1.1
Since: 1.1
Since: 1.4
Since: 1.1
Since: 1.4
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
0x10FFFF
.Since: 1.5
'?'
.Since: 1.5
'?'
.Since: 1.5
'?'
.Since: 1.5
'\\uFFFF'
.0
.Since: 1.5
'?'
.Since: 1.5
'?'
.Since: 1.5
0x10000
.'?'
.Since: 1.5
'\\u0000'
.Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
char
.Since: 1.5
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Since: 1.1
Parameters: value the character to wrap
Parameters: codePoint a unicode code point
Returns: 2 if codePoint >= 0x10000, 1 otherwise.
Since: 1.5
Returns: the character wrapped
Parameters: sequence the CharSequence index the index of the codepoint to get, starting at 0
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is negative or >= length()
Since: 1.5
Parameters: chars the character array in which to look index the index of the codepoint to get, starting at 0
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is negative or >= length()
Since: 1.5
Parameters: chars the character array in which to look index the index of the codepoint to get, starting at 0 limit the limit past which characters should not be examined
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is negative or >= limit, or if limit is negative or >= the length of the array
Since: 1.5
index-1
and index-2
to see if they form
a supplementary code point. If they do not, the character at
index-1
is returned.
Parameters: chars the character array index the index just past the codepoint to get, starting at 0
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is negative or >= length()
Since: 1.5
index-1
and index-2
to see if they form
a supplementary code point. If they do not, the character at
index-1
is returned. The start parameter is used to
limit the range of the array which may be examined.
Parameters: chars the character array index the index just past the codepoint to get, starting at 0 start the index before which characters should not be examined
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is > start or > the length of the array, or if limit is negative or >= the length of the array
Since: 1.5
index-1
and index-2
to see if they form
a supplementary code point. If they do not, the character at
index-1
is returned.
Parameters: sequence the CharSequence index the index just past the codepoint to get, starting at 0
Returns: the codepoint at the specified index
Throws: IndexOutOfBoundsException if index is negative or >= length()
Since: 1.5
Parameters: seq the CharSequence to inspect beginIndex the beginning of the range endIndex the end of the range
Returns: the number of Unicode code points in the given range of the sequence
Throws: NullPointerException if seq is null IndexOutOfBoundsException if beginIndex is negative, endIndex is larger than the length of seq, or if beginIndex is greater than endIndex.
Since: 1.5
Parameters: a the char array to inspect offset the beginning of the range count the length of the range
Returns: the number of Unicode code points in the given range of the array
Throws: NullPointerException if a is null IndexOutOfBoundsException if offset or count is negative or if offset + countendIndex is larger than the length of a.
Since: 1.5
Parameters: anotherCharacter Character to compare with this Character
Returns: a negative integer if this Character is less than anotherCharacter, zero if this Character is equal, and a positive integer if this Character is greater
Throws: NullPointerException if anotherCharacter is null
Since: 1.2
Parameters: ch character to convert into a digit radix radix in which ch is a digit
Returns: digit which ch represents in radix, or -1 not a valid digit
Parameters: codePoint character to convert into a digit radix radix in which ch is a digit
Returns: digit which ch represents in radix, or -1 not a valid digit
Parameters: o object to compare
Returns: true if o is a Character with the same value
'\0'
is returned. Otherwise the return value is in '0'-'9' and 'a'-'z'.
Parameters: digit digit to be converted into a character radix radix of digit
Returns: character representing digit in radix, or '\0'
Parameters: ch the character to look up
Returns: the directionality constant, or DIRECTIONALITY_UNDEFINED
Since: 1.4
See Also: DIRECTIONALITY_UNDEFINED DIRECTIONALITY_LEFT_TO_RIGHT DIRECTIONALITY_RIGHT_TO_LEFT DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC DIRECTIONALITY_EUROPEAN_NUMBER DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR DIRECTIONALITY_ARABIC_NUMBER DIRECTIONALITY_COMMON_NUMBER_SEPARATOR DIRECTIONALITY_NONSPACING_MARK DIRECTIONALITY_BOUNDARY_NEUTRAL DIRECTIONALITY_PARAGRAPH_SEPARATOR DIRECTIONALITY_SEGMENT_SEPARATOR DIRECTIONALITY_WHITESPACE DIRECTIONALITY_OTHER_NEUTRALS DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE DIRECTIONALITY_POP_DIRECTIONAL_FORMAT
Parameters: codePoint the character to look up
Returns: the directionality constant, or DIRECTIONALITY_UNDEFINED
Since: 1.5
See Also: DIRECTIONALITY_UNDEFINED DIRECTIONALITY_LEFT_TO_RIGHT DIRECTIONALITY_RIGHT_TO_LEFT DIRECTIONALITY_RIGHT_TO_LEFT_ARABIC DIRECTIONALITY_EUROPEAN_NUMBER DIRECTIONALITY_EUROPEAN_NUMBER_SEPARATOR DIRECTIONALITY_EUROPEAN_NUMBER_TERMINATOR DIRECTIONALITY_ARABIC_NUMBER DIRECTIONALITY_COMMON_NUMBER_SEPARATOR DIRECTIONALITY_NONSPACING_MARK DIRECTIONALITY_BOUNDARY_NEUTRAL DIRECTIONALITY_PARAGRAPH_SEPARATOR DIRECTIONALITY_SEGMENT_SEPARATOR DIRECTIONALITY_WHITESPACE DIRECTIONALITY_OTHER_NEUTRALS DIRECTIONALITY_LEFT_TO_RIGHT_EMBEDDING DIRECTIONALITY_LEFT_TO_RIGHT_OVERRIDE DIRECTIONALITY_RIGHT_TO_LEFT_EMBEDDING DIRECTIONALITY_RIGHT_TO_LEFT_OVERRIDE DIRECTIONALITY_POP_DIRECTIONAL_FORMAT
'\\u216C'
(the Roman numeral fifty) returns 50.
This method also returns values for the letters A through Z, (not
specified by Unicode), in these ranges: 'A'
through 'Z'
(uppercase); 'a'
through 'z'
(lowercase); and 'A'
through 'Z'
, 'a'
through
'z'
(full width variants).
If the character lacks a numeric value property, -1 is returned. If the character has a numeric value property which is not representable as a nonnegative integer, such as a fraction, -2 is returned. character argument boundary = [Nd]|[Nl]|[No]|U+0041-U+005A|U+0061-U+007A |U+FF21-U+FF3A|U+FF41-U+FF5A
Parameters: ch character from which the numeric value property will be retrieved
Returns: the numeric value property of ch, or -1 if it does not exist, or -2 if it is not representable as a nonnegative integer
Since: 1.1
'\\u216C'
(the Roman numeral fifty) returns 50.
This method also returns values for the letters A through Z, (not
specified by Unicode), in these ranges: 'A'
through 'Z'
(uppercase); 'a'
through 'z'
(lowercase); and 'A'
through 'Z'
, 'a'
through
'z'
(full width variants).
If the character lacks a numeric value property, -1 is returned. If the character has a numeric value property which is not representable as a nonnegative integer, such as a fraction, -2 is returned. character argument boundary = [Nd]|[Nl]|[No]|U+0041-U+005A|U+0061-U+007A |U+FF21-U+FF3A|U+FF41-U+FF5A
Parameters: codePoint character from which the numeric value property will be retrieved
Returns: the numeric value property of ch, or -1 if it does not exist, or -2 if it is not representable as a nonnegative integer
Since: 1.5
Parameters: ch character from which the general category property will be retrieved
Returns: the character category property of ch as an integer
Since: 1.1
See Also: UNASSIGNED UPPERCASE_LETTER LOWERCASE_LETTER TITLECASE_LETTER MODIFIER_LETTER OTHER_LETTER NON_SPACING_MARK ENCLOSING_MARK COMBINING_SPACING_MARK DECIMAL_DIGIT_NUMBER LETTER_NUMBER OTHER_NUMBER SPACE_SEPARATOR LINE_SEPARATOR PARAGRAPH_SEPARATOR CONTROL FORMAT PRIVATE_USE SURROGATE DASH_PUNCTUATION START_PUNCTUATION END_PUNCTUATION CONNECTOR_PUNCTUATION OTHER_PUNCTUATION MATH_SYMBOL CURRENCY_SYMBOL MODIFIER_SYMBOL INITIAL_QUOTE_PUNCTUATION FINAL_QUOTE_PUNCTUATION
Parameters: codePoint character from which the general category property will be retrieved
Returns: the character category property of ch as an integer
Since: 1.5
See Also: UNASSIGNED UPPERCASE_LETTER LOWERCASE_LETTER TITLECASE_LETTER MODIFIER_LETTER OTHER_LETTER NON_SPACING_MARK ENCLOSING_MARK COMBINING_SPACING_MARK DECIMAL_DIGIT_NUMBER LETTER_NUMBER OTHER_NUMBER SPACE_SEPARATOR LINE_SEPARATOR PARAGRAPH_SEPARATOR CONTROL FORMAT PRIVATE_USE SURROGATE DASH_PUNCTUATION START_PUNCTUATION END_PUNCTUATION CONNECTOR_PUNCTUATION OTHER_PUNCTUATION MATH_SYMBOL CURRENCY_SYMBOL MODIFIER_SYMBOL INITIAL_QUOTE_PUNCTUATION FINAL_QUOTE_PUNCTUATION
Returns: the value of the wrapped character
Parameters: ch character to test
Returns: true if ch is a Unicode character, else false
See Also: Character Character Character Character Character Character
Parameters: codePoint character to test
Returns: true if ch is a Unicode character, else false
Since: 1.5
'0'
is a digit. A character is a Unicode digit if
getType() returns DECIMAL_DIGIT_NUMBER.
Parameters: ch character to test
Returns: true if ch is a Unicode decimal digit, else false
'0'
is a digit. A character is a Unicode digit if
getType() returns DECIMAL_DIGIT_NUMBER.
Parameters: codePoint character to test
Returns: true if ch is a Unicode decimal digit, else false
Since: 1.5
Parameters: ch the character
Returns: true if the character is a high surrogate character
Since: 1.5
' '
through ''
, ''
through
''
, and ''
through
''
), and FORMAT characters.
Parameters: ch character to test
Returns: true if ch is ignorable in a Unicode or Java identifier
Since: 1.1
' '
through ''
, ''
through
''
, and ''
through
''
), and FORMAT characters.
Parameters: codePoint character to test
Returns: true if ch is ignorable in a Unicode or Java identifier
Since: 1.5
Parameters: ch character to test
Returns: true if ch is an ISO Control character, else false
Since: 1.1
Parameters: codePoint the character to check
Returns: true if the character is in one of the above ranges
Since: 1.5
Parameters: ch character to test
Returns: true if ch can follow the first letter in a Java identifier
Since: 1.1
Parameters: codePoint character to test
Returns: true if ch can follow the first letter in a Java identifier
Since: 1.5
Parameters: ch character to test
Returns: true if ch can start a Java identifier, else false
Since: 1.1
Parameters: codePoint character to test
Returns: true if ch can start a Java identifier, else false
Since: 1.5
Deprecated: Replaced by {@link #isJavaIdentifierStart(char)}
Determines if a character can start a Java identifier. This is the combination of isLetter, any character where getType returns LETTER_NUMBER, currency symbols (like '$'), and connecting punctuation (like '_').Parameters: ch character to test
Returns: true if ch can start a Java identifier, else false
See Also: Character Character Character Character Character Character
Deprecated: Replaced by {@link #isJavaIdentifierPart(char)}
Determines if a character can follow the first letter in a Java identifier. This is the combination of isJavaLetter (isLetter, type of LETTER_NUMBER, currency, connecting punctuation) and digit, numeric letter (like Roman numerals), combining marks, non-spacing marks, or isIdentifierIgnorable.Parameters: ch character to test
Returns: true if ch can follow the first letter in a Java identifier
See Also: Character Character Character Character Character Character Character
Parameters: ch character to test
Returns: true if ch is a Unicode letter, else false
See Also: Character Character Character Character Character Character Character Character Character
Parameters: codePoint character to test
Returns: true if ch is a Unicode letter, else false
Since: 1.5
See Also: Character Character Character Character Character Character Character Character
Parameters: ch character to test
Returns: true if ch is a Unicode letter or a Unicode digit, else false
See Also: Character Character Character Character Character Character
Parameters: codePoint character to test
Returns: true if ch is a Unicode letter or a Unicode digit, else false
Since: 1.5
'a'
is lowercase. Returns true if getType() returns
LOWERCASE_LETTER.
Parameters: ch character to test
Returns: true if ch is a Unicode lowercase letter, else false
'a'
is lowercase. Returns true if getType() returns
LOWERCASE_LETTER.
Parameters: codePoint character to test
Returns: true if ch is a Unicode lowercase letter, else false
Since: 1.5
Parameters: ch the character
Returns: true if the character is a low surrogate character
Since: 1.5
(
(LEFT PARENTHESIS) appears as '(' in
left-to-right text, but ')' in right-to-left text.
Parameters: ch the character to look up
Returns: true if the character is mirrored
Since: 1.4
(
(LEFT PARENTHESIS) appears as '(' in
left-to-right text, but ')' in right-to-left text.
Parameters: codePoint the character to look up
Returns: true if the character is mirrored
Since: 1.5
Deprecated: Replaced by {@link #isWhitespace(char)}
Determines if a character is a ISO-LATIN-1 space. This is only the five characters'\t'
, '\n'
, '\f'
,
'\r'
, and ' '
.
Parameters: ch character to test
Returns: true if ch is a space, else false
Parameters: ch character to test
Returns: true if ch is a Unicode space, else false
Since: 1.1
See Also: Character
Parameters: codePoint character to test
Returns: true if ch is a Unicode space, else false
Since: 1.5
See Also: Character
Parameters: codePoint a Unicode code point
Returns: true
if code point is in supplementary range
Since: 1.5
Parameters: ch1 the first character ch2 the first character
Returns: true if the characters compose a surrogate pair
Since: 1.5
Parameters: ch character to test
Returns: true if ch is a Unicode titlecase letter, else false
Parameters: codePoint character to test
Returns: true if ch is a Unicode titlecase letter, else false
Since: 1.5
Parameters: ch character to test
Returns: true if ch can follow the first letter in a Unicode identifier
Since: 1.1
Parameters: codePoint character to test
Returns: true if ch can follow the first letter in a Unicode identifier
Since: 1.5
Parameters: ch character to test
Returns: true if ch can start a Unicode identifier, else false
Since: 1.1
Parameters: codePoint character to test
Returns: true if ch can start a Unicode identifier, else false
Since: 1.5
'A'
is uppercase. Returns true if getType() returns
UPPERCASE_LETTER.
Parameters: ch character to test
Returns: true if ch is a Unicode uppercase letter, else false
'A'
is uppercase. Returns true if getType() returns
UPPERCASE_LETTER.
Parameters: codePoint character to test
Returns: true if ch is a Unicode uppercase letter, else false
Since: 1.5
Parameters: codePoint a Unicode code point
Returns: true
if code point is valid
Since: 1.5
' '
, ' '
, and ' '
);
and these characters: ' '
, '
'
,
''
, ''
, '
'
,
''
, ''
, ''
,
and ''
.
Parameters: ch character to test
Returns: true if ch is Java whitespace, else false
Since: 1.1
See Also: Character
' '
, ' '
, and ' '
);
and these characters: ' '
, '
'
,
''
, ''
, '
'
,
''
, ''
, ''
,
and ''
.
Parameters: codePoint character to test
Returns: true if ch is Java whitespace, else false
Since: 1.5
See Also: Character
codePointOffset
code points from index
.Parameters: seq the CharSequence index the start position in the CharSequence codePointOffset the number of code points offset from the start position
Returns: the index into the CharSequence that is codePointOffset code points offset from index
Throws: NullPointerException if seq is null IndexOutOfBoundsException if index is negative or greater than the length of the sequence. IndexOutOfBoundsException if codePointOffset is positive and the subsequence from index to the end of seq has fewer than codePointOffset code points IndexOutOfBoundsException if codePointOffset is negative and the subsequence from the start of seq to index has fewer than (-codePointOffset) code points
Since: 1.5
codePointOffset
code points from index
.Parameters: a the char array start the start index of the subarray count the length of the subarray index the index to be offset codePointOffset the number of code points offset from index
Returns: the index into the char array
Throws: NullPointerException if a is null IndexOutOfBoundsException if start or count is negative or if start + count is greater than the length of the array IndexOutOfBoundsException if index is less than start or larger than start + count IndexOutOfBoundsException if codePointOffset is positive and the subarray from index to start + count - 1 has fewer than codePointOffset code points. IndexOutOfBoundsException if codePointOffset is negative and the subarray from start to index - 1 has fewer than (-codePointOffset) code points
Since: 1.5
Since: 1.5
Parameters: codePoint the unicode code point
Returns: the UTF-16 representation of that code point
Throws: IllegalArgumentException if the code point is not a valid unicode code point
Since: 1.5
Parameters: codePoint the unicode code point dst the target char array dstIndex the start index for the target
Returns: number of characters written to dst
Throws: IllegalArgumentException if codePoint
is not a
valid unicode code point NullPointerException if dst
is null
IndexOutOfBoundsException if dstIndex
is not valid
in dst
or if the UTF-16 representation does not
fit into dst
Since: 1.5
Parameters: high the high character of the pair low the low character of the pair
Returns: the corresponding code point
Since: 1.5
Parameters: ch character to convert to lowercase
Returns: lowercase mapping of ch, or ch if lowercase mapping does not exist
Parameters: codePoint character to convert to lowercase
Returns: lowercase mapping of ch, or ch if lowercase mapping does not exist
Since: 1.5
Returns: a String containing one character -- the wrapped character of this instance
Parameters: ch the character to convert
Returns: a String containing the character
Since: 1.4
Parameters: ch character to convert to titlecase
Returns: titlecase mapping of ch, or ch if titlecase mapping does not exist
Parameters: codePoint character to convert to titlecase
Returns: titlecase mapping of ch, or ch if titlecase mapping does not exist
Since: 1.5
Parameters: ch character to convert to uppercase
Returns: uppercase mapping of ch, or ch if uppercase mapping does not exist
Parameters: codePoint character to convert to uppercase
Returns: uppercase mapping of ch, or ch if uppercase mapping does not exist
Since: 1.5
Character
object wrapping the value.
In contrast to the Character
constructor, this method
will cache some values. It is used by boxing conversion.
Parameters: val the value to wrap
Returns: the Character
Since: 1.5