l  western alphabet (single byte character set)

l  Asian character set (double byte character set).


1A2B, 3C, 4D, 5E, 6F, 7G, 8H, 9I, 10J, 11K, 12L, 13M, 14N, 15O, 16P, 17Q, 18R, 19S, 20T, 21U, 22V, 23W, 24X, 25Y, 26Z


Representing Text (single byte)

  • A convention is needed for how to represent text
  • Simplest solution: Use one byte per character
  • o   Allows to represent 256 different characters (including control characters)
  • o   Example: <4B> is 'K', <65> is 'e', <69> is 'i', <6F> is 'o', thus <4B><65><69><6F> stands for 'Keio' (A)
  • o   Another example: <D2> is 'K', <85> is 'e', <89> is 'i', <96> is 'o', thus <D2><85><89><96>stands for 'Keio' (E)
  • Standard for character encoding is indispensable to make storage and communication work
  • The above examples use ASCII (A, most computers) and EBCDIC (E; IBM mainframes)
  • For 'basic English', 7 bits would be enough
  • 8 bits are enough for various European regions/languages

Quote from: https://www.w3.org/People/D%C3%BCrst/SFC/2004/0418Hagino.html

A double-byte character set (DBCS), "expanded 8-bit character set"

an extended single-byte character set (SBCS)

Q: What does it mean that “one-byte language”?

You can use one byte (=8 bit or 256 different values) to represent basically each characters commonly used in English.





Dare to be different.愛你所愛

外語專科出版社 發表在 痞客邦 PIXNET 留言(0) 人氣()