Possessing a big amount of features and proportionally few data points marked the first challenge that could be solved with Dimension Reduction. Compared with other data analysis techniques, only the Multiple Correspondence Analysis (MCA) could reduce the provided 400 features to a total of 14 without any loss of information. That is why MCA served as basis for the developed Machine Learning Model.
The second challenge was the selection of the most appropriate regression algorithm as the different applicable algorithms are similar in principle. The training dataset was split and Cross Validation was applied. Fine-tuning the parameters of the chosen algorithm allowed for a prediction of the original test dataset.
Lacking of data confounding external factors could not be considered in this Machine Learning Model.