Character Set Table CS3: (UCS-2 <-> UTF-8)
A-36 AT Commands Reference Manual June 30, 2008
Character Set Table CS3: (UCS-2 <-> UTF-8)

The following table shows the conversion between the UCS-2 and UTF-8 character sets.

Note: Conversion from the default GSM alphabet to the above character set is straightforward.

Conversions of the characters listed below the table are not supplied.

Character Set Table CS6: (UCS-2 Full table)

Character Set Table CS6: (UCS-2) is provided on CD due to its size.

Character Set Table CS7: (ASCII table)

The following table shows the conversion for the ASCII character set.

UCS2 UTF-8
Byte Encoding Bit Encoding Byte 1 Byte 2 Byte 3
0000 - 007F 000000000xxxxxx
x0xxxxxxx
0080 - 07FF 00000yyyyyxxxxx
x110yyyyy 10xxxxxx
0800 - FFFF zzzzyyyyyyxxxxxx 1110zzzz 10yyyyyy 10xxxxxx
Decimal Octal Hex Binary Val ue Description
000 000 000 00000000 NUL (Null char.)
001 001 001 00000001 SOH (Start of
Header)
002 002 002 00000010 STX (Start of Text)
003 003 003 00000011 ETX (End of Text)
004 004 004 00000100 EOT (End of
Transmission)
005 005 005 00000101 ENQ (Enquiry)
006 006 006 00000110 ACK (Acknowledgm
ent)
007 007 007 00000111 BEL (Bell)
008 010 008 00001000 BS (Backspace)
009 011 009 00001001 HT (Horizontal
Tab)
010 012 00A 00001010 LF (Line Feed)