首页 > 其他 > 详细

What's the difference between encoding and charset?

时间:2021-04-10 00:29:23      阅读:23      评论:0      收藏:0      [点我收藏+]

I am confused about the text encoding and charset. For many reasons, I have to learn non-Unicode, non-UTF8 stuff in my upcoming work.

I find the word "charset" in email headers as in "ISO-2022-JP", but there‘s no such a encoding in text editors. (I looked around the different text editors.)

What‘s the difference between text encoding and charset? I‘d appreciate it if you could show me some use case examples.

 
回答:

Basically:

  1. charset is the set of characters you can use
  2. encoding is the way these characters are stored into memory

回答2

Every encoding has a particular charset associated with it, but there can be more than one encoding for a given charset. A charset is simply what it sounds like, a set of characters. There are a large number of charsets, including many that are intended for particular scripts or languages.

However, we are well along the way in the transition to Unicode, which includes a character set capable of representing almost all the world‘s scripts. However, there are multiple encodings for Unicode. An encoding is a way of mapping a string of characters to a string of bytes. Examples of Unicode encodings include UTF-8, UTF-16 BE, and UTF-16 LE . Each of these has advantages for particular applications or machine architectures.

 

 

 

 

 

What's the difference between encoding and charset?

原文:https://www.cnblogs.com/chucklu/p/14638222.html

(0)
(0)
   
举报
评论 一句话评论(0
关于我们 - 联系我们 - 留言反馈 - 联系我们:wmxa8@hotmail.com
© 2014 bubuko.com 版权所有
打开技术之扣,分享程序人生!