python 使用 json.loads 解码后编码变为 unicode 解决办法

写了一个小东西,更新入库,然后重新根据长度判断是否更新,这里遇到了坑,长度不一样,最后确定在json loads后编码问题.

编码不对会造成很多问题,比如字符串长度错误,无法匹配等等.

错误示例:

>>> import json

>>> original_list = [‘a’, ‘b’]

>>> json_list = json.dumps(original_list)

>>> json_list

‘["a", "b"]’

>>> new_list = json.loads(json_list)

>>> new_list

[u’a’, u’b’] # I want these to be of type `str`, not `unicode`

解决办法,使用object_hook:

import json

def json_load_byteified(file_handle):

return _byteify(

json.load(file_handle, object_hook=_byteify),

ignore_dicts=True

)

def json_loads_byteified(json_text):

return _byteify(

json.loads(json_text, object_hook=_byteify),

ignore_dicts=True

)

def _byteify(data, ignore_dicts = False):

# if this is a unicode string, return its string representation

if isinstance(data, unicode):

return data.encode(‘utf-8’)

# if this is a list of values, return list of byteified values

if isinstance(data, list):

return [ _byteify(item, ignore_dicts=True) for item in data ]

# if this is a dictionary, return dictionary of byteified keys and values

# but only if we haven’t already byteified it

if isinstance(data, dict) and not ignore_dicts:

return {

_byteify(key, ignore_dicts=True): _byteify(value, ignore_dicts=True)

for key, value in data.iteritems()

}

# if it’s anything else, return it in its original form

return data

将 json.loads 修改为 json_loads_byteified

使用示例:

>>> json_loads_byteified(‘{"Hello": "World"}’)

{‘Hello’: ‘World’}

>>> json_loads_byteified(‘"I am a top-level string"’)

‘I am a top-level string’

>>> json_loads_byteified(‘7’)

>>> json_loads_byteified(‘["I am inside a list"]’)

[‘I am inside a list’]

>>> json_loads_byteified(‘[[[[[[[["I am inside a big nest of lists"]]]]]]]]’)

[[[[[[[[‘I am inside a big nest of lists’]]]]]]]]

>>> json_loads_byteified(‘{"foo": "bar", "things": [7, {"qux": "baz", "moo": {"cow": ["milk"]}}]}’)

{‘things’: [7, {‘qux’: ‘baz’, ‘moo’: {‘cow’: [‘milk’]}}], ‘foo’: ‘bar’}

>>> json_load_byteified(open(‘somefile.json’))

{‘more json’: ‘from a file’}

转载自:https://stackoverflow.com/questions/956867/how-to-get-string-objects-instead-of-unicode-from-json

python 使用 json.loads 解码后编码变为 unicode 解决办法

About 欧阳逍遥

发表回复

Related Posts

python dict list 字典数组 根据某个key的值排序

python 终止运行中死循环的线程 kill Thread 完美解决办法

python selenium PhantomJS 动态设置代理IP

About 欧阳逍遥

发表回复

python dict list 字典数组根据某个key的值排序