# Short description for the data columns

## Training data

* formula: chemical formula 	
* temp: temperature / 100K (to avoid large numbers for T^x)
* Conduct. (S/cm): electrical conductivity
* Seebeck (uV/K): Seebeck coefficient
* kappa (W/m K): thermal conductivity	
* ZT: thermoelectric figure of merit	
* pf, uW/(cm K^2): thermoelectric power factor
* temp: temperature / 100K (to avoid large numbers for T^x)
* temp^2: (temperature / 100K)^2
* temp^3: (temperature / 100K)^3	
* temp^4: (temperature / 100K)^4	
* cluster_number: which cluster a given material lies is (1, 2, 3 or 4 which corresponds to orange, green, red or violet, respectively).

max: Largest value of the constituent elements. For example, largest number of valence electrons in AB2 of either element A or element B. min: Smallest value... avg: Average value... std: Deformed standard deviation...

* avg_Atomic number: Number of protons	
* avg_L quantum number: Azimuthal quantum number
* avg_Pauling EN: Pauling electronegativity
* avg_Gordy EN: Gordy electronegativity	
* avg_Metallic valence: Metallic valence
* avg_Number of valence electrons: Number of valence electrons
* avg_Number of s electrons: Number of s electrons
* avg_Specific heat (J/g•K): Specific heat (J/g•K)
* avg_Thermal conductivity (W/m•K): Thermal conductivity (W/m•K)
* std_Atomic number: rest of the columns according to the above definition of max, min, avg, std	
* std_L quantum number	
* std_Pauling EN	
* std_Gordy EN	
* std_Metallic valence	
* std_Number of valence electrons	
* std_Number of s electrons	
* std_Specific heat (J/g•K)	
* std_Thermal conductivity (W/m•K)	
* max_Atomic number	
* max_L quantum number	
* max_Pauling EN	
* max_Gordy EN	
* max_Metallic valence	
* max_Number of valence electrons	
* max_Number of s electrons	
* max_Specific heat (J/g•K)	
* max_Thermal conductivity (W/m•K)	
* min_Atomic number	
* min_L quantum number	
* min_Pauling EN	
* min_Gordy EN	
* min_Metallic valence	
* min_Number of valence electrons	
* min_Number of s electrons	
* min_Specific heat (J/g•K)	
* min_Thermal conductivity (W/m•K)

## Predicted data

* material_id	: Material id as described by materialsproject.org
* formula: Chemical formula
* icsd_ids: Material id as described by icsd.fiz-karlsruhe.de
* distance_mean: mean value of distances from a given material to all the data points (materials) in the training dataset	
* nearest_formula: Closest (i.e. most similar) material in the training dataset	
* distance_min: distance to nearest_formula

The following columns are defined as follows:

* see: Seebeck coefficient
* el: electrical conductivity
* therm: thermal conductivity
* zt: thermoelectric figure of merit
* integer number: temperature in K
* fixed/nonfixed: fixed or nonfixed temperature 

All values can be above (1) or below (0) a threshold value

* see_300_fixed
* see_400_fixed
* see_700_fixed
* el_300_fixed
* el_400_fixed
* el_700_fixed
* therm_300_fixed
* therm_400_fixed
* therm_700_fixed
* see_300_nonfixed
* el_300_nonfixed
* therm_300_nonfixed
* zt_300_nonfixed
* see_400_nonfixed 
* el_400_nonfixed
* therm_400_nonfixed
* zt_400_nonfixed
* see_700_nonfixed
* el_700_nonfixedtemperature 
* therm_700_nonfixed
* zt_700_nonfixed
* sum: The sum of the previous 21 columns, i.e. an integer between 0 and 21
* price, usd/mol: approximate cost in US$ per mol
* space_group: The space group as downloaded via 'The Materials API' using the material id
