Machine Learning

In this part we will assess ML model

Modeling with Extracted Features from EDA

Drop str column

Correlation Plot

Top correlation with Gross_worldwide

Linear Regression

Mean MAE of test set with 1000 loops is 52M$ ~ 1200 Tỷ VND

Mean MAE of test set with 1000 loops is 46M$ ~ 1050 Tỷ VND

Random Forest

Mean MAE of test 38M$ ~ 870 Tỷ Vietnam Dong

Modeling with Extracted Features in Training

Initialization

Linear Regression

Since it takes time to extract feature while running so we run only 10 time.

The result is MAE on test is 48M $

Now we will see Linear Regression give how much coeficient on Data

Now conduct a coefficient table for each attributes to see what are the best predictors

Random Forest Regressor

We run only 1 time with 1000 estimators.

The result is MAE on test is 43M $