keras model evaluate. are you making this mistake when implementing the macro f1 score in keras exploratory data analysis scores confusion matrix