etiq.dataprofile package
Submodules
etiq.dataprofile.dataprofile module
- class etiq.dataprofile.dataprofile.DataProfile(name: str, type: ~etiq.dataprofile.dataprofile.DatasetType = DatasetType.Base, hash: str = '', feature_profiles: ~typing.List[~etiq.dataprofile.dataprofile.FeatureDataProfile] = <factory>)
Bases:
object
A simple data class to represent a data profile
- add_feature_profile(afeature_profile: FeatureDataProfile)
- feature_profiles: List[FeatureDataProfile]
- hash: str = ''
- name: str
- type: DatasetType = 1
- class etiq.dataprofile.dataprofile.DatasetType(value)
Bases:
Enum
An enumeration.
- Base = 1
- Comparison = 2
- class etiq.dataprofile.dataprofile.FeatureDataProfile(feature: str, type: FeatureType = FeatureType.Continuous, distinct_values: int = 0, minimum: float = nan, maximum: float = nan, missing_values: int = 0, missing_values_percentage: float = 0.0, condition: str | None = None)
Bases:
object
A simple data class to represent a data profile for a feature
- condition: str | None = None
- distinct_values: int = 0
- feature: str
- maximum: float = nan
- minimum: float = nan
- missing_values: int = 0
- missing_values_percentage: float = 0.0
- type: FeatureType = 2
Module contents
- class etiq.dataprofile.DataProfile(name: str, type: ~etiq.dataprofile.dataprofile.DatasetType = DatasetType.Base, hash: str = '', feature_profiles: ~typing.List[~etiq.dataprofile.dataprofile.FeatureDataProfile] = <factory>)
Bases:
object
A simple data class to represent a data profile
- add_feature_profile(afeature_profile: FeatureDataProfile)
- feature_profiles: List[FeatureDataProfile]
- hash: str = ''
- name: str
- type: DatasetType = 1
- class etiq.dataprofile.FeatureDataProfile(feature: str, type: FeatureType = FeatureType.Continuous, distinct_values: int = 0, minimum: float = nan, maximum: float = nan, missing_values: int = 0, missing_values_percentage: float = 0.0, condition: str | None = None)
Bases:
object
A simple data class to represent a data profile for a feature
- condition: str | None = None
- distinct_values: int = 0
- feature: str
- maximum: float = nan
- minimum: float = nan
- missing_values: int = 0
- missing_values_percentage: float = 0.0
- type: FeatureType = 2