Consider this json file named h.json
I want to convert this into a python dataclass.
{"acc1":{"email":"[email protected]","password":"acc1","name":"ACC1","salary":1},"acc2":{"email":"[email protected]","password":"acc2","name":"ACC2","salary":2}}
I could use an alternative constructor for getting each account, for example:
import json
from dataclasses import dataclass@dataclass
class Account(object):email:strpassword:strname:strsalary:int@classmethoddef from_json(cls, json_key):file = json.load(open("h.json"))return cls(**file[json_key])
but this is limited to what arguments (email, name, etc.) were defined in the dataclass.
What if I were to modify the json to include another thing, say age?
The script would end up returning a TypeError
, specifically TypeError: __init__() got an unexpected keyword argument 'age'
.
Is there a way to dynamically adjust the class attributes based on the keys of the dict (json object), so that I don't have to add attributes each time I add a new key to the json?
Since it sounds like your data might be expected to be dynamic and you want the freedom to add more fields in the JSON object without reflecting the same changes in the model, I'd also suggest to check out typing.TypedDict
instead a dataclass
.
Here's an example with TypedDict
, which should work in Python 3.7+. Since TypedDict was introduced in 3.8, I've instead imported it from typing_extensions
so it's compatible with 3.7 code.
from __future__ import annotationsimport json
from io import StringIO
from typing_extensions import TypedDictclass Account(TypedDict):email: strpassword: strname: strsalary: intjson_data = StringIO("""{"acc1":{"email":"[email protected]","password":"acc1","name":"ACC1","salary":1},"acc2":{"email":"[email protected]","password":"acc2","name":"ACC2","salary":2,"someRandomKey": "string"}
}
""")data = json.load(json_data)
name_to_account: dict[str, Account] = dataacct = name_to_account['acc2']# Your IDE should be able to offer auto-complete suggestions within the
# brackets, when you start typing or press 'Ctrl + Space' for example.
print(acct['someRandomKey'])
If you are set on using dataclasses to model your data, I'd suggest checking out a JSON serialization library like the dataclass-wizard (disclaimer: I am the creator) which should handle extraneous fields in the JSON data as mentioned, as well as a nested dataclass model if you find your data becoming more complex.
It also has a handy tool that you can use to generate a dataclass schema from JSON data, which can be useful for instance if you want to update your model class whenever you add new fields in the JSON file as mentioned.