In NLP, which metrics are commonly used to evaluate model performance?
F1-score
Mean Squared Error
BLEU

Linguistics Exercises are loading ...