|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcom.waveset.util.Encoding
Utilities related to character encodings.
Field Summary | |
static java.lang.String |
ASCII
|
static java.lang.String |
code_id
|
static java.lang.String |
LATIN1
|
static java.lang.String |
LATIN2
|
static java.lang.String |
UCS2
|
static java.lang.String |
UCS4
|
static java.lang.String |
UTF16
|
static java.lang.String |
UTF8
|
Constructor Summary | |
Encoding()
|
Method Summary | |
static java.lang.String |
decode(java.lang.String psz,
java.lang.String encoding)
Convert a string in the specified encoding to the default encoding. |
static java.lang.String |
decodePseudoUTF8(java.lang.String pseudoUTF8)
|
static java.lang.String |
encode(java.lang.String psz,
java.lang.String encoding)
Convert an string in the default encoding to the specified encoding. |
static java.lang.String |
encodePseudoUTF8(java.lang.String psz)
|
static java.lang.String |
getDefaultEncoding()
|
static boolean |
isValidASCII(byte[] bytes)
|
static boolean |
isValidASCII(java.lang.String s)
|
static boolean |
isValidUTF8(byte[] bytes,
boolean beStrict)
|
static int |
lengthInUTF8(java.lang.String psz)
|
static void |
main(java.lang.String[] args)
|
static java.lang.String |
toUnicodeEscapedAscii(java.lang.String str)
If you are concerned with how fast this is, it goes at about 6MB/sec for ASCII, and almost 1MB/sec for non-ASCII. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
public static final java.lang.String code_id
public static final java.lang.String UTF8
public static final java.lang.String UTF16
public static final java.lang.String UCS2
public static final java.lang.String UCS4
public static final java.lang.String LATIN1
public static final java.lang.String LATIN2
public static final java.lang.String ASCII
Constructor Detail |
public Encoding()
Method Detail |
public static int lengthInUTF8(java.lang.String psz) throws InternalError
InternalError
public static java.lang.String getDefaultEncoding()
public static java.lang.String decodePseudoUTF8(java.lang.String pseudoUTF8)
This method assumes that the lower-order eight bits of each character in the input string represent a properly encoded UTF-8 byte.
public static java.lang.String encodePseudoUTF8(java.lang.String psz)
public static java.lang.String encode(java.lang.String psz, java.lang.String encoding) throws InvalidArgument
psz
- a String in the default encoding.encoding
- the name of a character encoding scheme.
psz
converted to the specified encoding
.
For example, a call to encode("pe�a", Encoding.UTF8);
would return "peña"
.
Inverse of {@link #decode).
InvalidArgument
getDefaultEncoding()
public static java.lang.String decode(java.lang.String psz, java.lang.String encoding) throws InvalidArgument
psz
- a String containing characters
constructed in the default encoding scheme
from bytes in the specified encoding.encoding
- the name of a character encoding scheme.
psz
converted back to the default encoding.
For example, a call to decode("peña", Encoding.UTF8);
would return "pe�a"
.
InvalidArgument
getDefaultEncoding()
public static boolean isValidASCII(byte[] bytes)
public static boolean isValidASCII(java.lang.String s)
true
if the specified string
contains only characters in the ASCII range;
otherwise return false
.public static boolean isValidUTF8(byte[] bytes, boolean beStrict)
bytes
- byte array to checkbeStrict
- if true, specifies rigorous validation.
Specifically, disallows "pseudo-UTF8".
true
if the given sequence of bytes is valid UTF-8;
otherwise false
.
public static java.lang.String toUnicodeEscapedAscii(java.lang.String str)
public static void main(java.lang.String[] args) throws java.lang.Exception
java.lang.Exception
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |