目录
介绍
python3 字符串默认编码为unicode,二进制数据使用byte类型表示,utf-8可以看做是unicode的一个扩展集
字符串通过 编码 转换成字节码,字节码通过 解码 成为字符串
encode:str –> bytes
decode:bytes – > str
代码如下:
import sys
print("print1:",sys.getdefaultencoding())
name ="中国"
name = name.encode("utf-8")
print("print2:",type(name))
name = name.decode("utf-8")
name = name.encode("gbk")
print("print4:",type(name))
输出结果:
print1: utf-8
print2: <class 'bytes'>
print4: <class 'bytes'>
字符串 转 字节(encode)
s1 = "中国"
//法1:
s1 = s1.encode('utf-8')
//方2:,缺省也是可以滴
s1 = s1.encode()
//方3:
s1 = s1.encode(encoding='utf-8')
//法4:我对此语句的理解是,现在用bytes(s1,encoding='utf-8')这个方法转换成字符节。原先是按照utf-8进行编码,
s1 = bytes(s1, encoding='utf-8')
字节 转 字符串(decode)
s1 = "中国"
s1 = s1.encode("utf-8")
//法1:现在的s1是字节类型
str = s1.decode('utf-8')
//法2:其中1是二进制类型,我对此语句的理解是,现在用str(bytes,encoding='utf-8')这个方法转换成字符串。按照utf-8进行编码,
str1 = str(s1, encoding='utf-8')