Batch yield and purity | The two columns in the data set are: the percentage yield from a batch reactor, and the purity of the feedstock.
The feedstock is what we add to the reactor, and the yield is measured after the reaction is completed.
The cause-and-effect direction is that the purity of the feedstock has (potential) impact on the yield. | 241 | 2 | multivariateregressionleast-squares |
Bioreactor yields | The percentage yield from a bioreactor given the temperature, impeller speed, duration, and whether or not the reactor has baffles. | 14 | 5 | multivariatecategoricalregression |
Cheddar cheese | Concentrations of acetic acid, H2S, and lactic acid in 30 samples of mature cheddar cheese. A subjective taste value is also provided. | 30 | 4 | multivariateregression |
Class grades | Grades from a Chemical Engineering course at McMaster University. | 99 | 6 | multivariatemissing-dataregression |
Distillation tower | Snapshot measurements on 27 variables from a distillation column; measured over 2.5 years. | 253 | 27 | multivariateoutliersregression |
Oil company DOE | Experimental data; testing amount of 4 materials added (A, B, C, D) in order to achieve a certain volumetric heat capacity, y. | 19 | 5 | multivariatecategoricaldoeregression |
Raw material outcome | Six characterizing measurements for batches of plastic pellets; the outcome when using this material, either Poor or Adequate, is also provided. | 24 | 7 | multivariatecategoricalregression |
Unlimited time test | The grades from a midterm exam, as well as the time taken by the student to write the exam. It was an "infinite" time midterm, so there was no time pressure to finish within the allocated period. | 80 | 2 | univariateregressionleast-squares |
Unlimited time test 2 | The grades from a midterm exam, as well as the time taken by the student to write the exam. It was an "infinite" time midterm, so there was no time pressure to finish within the allocated period. The test results were from 2013. | 61 | 2 | univariateregressionleast-squares |
Unlimited time test 3 | The grades from a midterm exam, as well as the time taken by the student to write the exam. It was an "infinite" time midterm, so there was no time pressure to finish within the allocated period. The test results were from 2013. | 89 | 2 | univariateregressionleast-squares |
Wine DOE | Data from a fractional factorial for profiling a new wine. The last 5 columns are the taste values from a panel of judges. Higher values are a better overall taste. | 16 | 13 | multivariatedoeregression |