The trainer will store the data on disk (for now) in a structured folder that will hold training models that will be used to generate the synthetic data.
The trainer will store the data on disk (for now) in a structured folder that will hold training models that will be used to generate the synthetic data.
2. Generate a candidate dataset from the learnt features
**Generate a candidate dataset from the learned features**
import pandas as pd
import pandas as pd
@ -46,11 +47,14 @@ GANS will generate data assuming the original data has all the value space neede
- No new data will be created
- No new data will be created
Assuming we have a dataset with an gender attribute with values [M,F].
Assuming we have a dataset with an gender attribute with values [M,F].
The synthetic data will not be able to generate genders outside [M,F]
The synthetic data will not be able to generate genders outside [M,F]
- Not advised on continuous values
- Not advised on continuous values
GANS work well on discrete values and thus are not advised to be used.
GANS work well on discrete values and thus are not advised to be used.