Is there a way to print the confusion matrix #72

rsuwaileh · 2021-03-17T22:30:41Z

Hey,

I want to print the FP and FN for my system. I checked the code and it seems you don't use them in the calculation and just use pred_sum and true_sum. Is there an easy way to get these numbers?

Thanks!

The text was updated successfully, but these errors were encountered:

rsuwaileh · 2021-03-17T22:36:18Z

I just found this answer. However, this seems to be computed on the token level. Is there a way to get the confusion matrix on the entity level?

In the example in code you show these numbers:

    Example:
        >>> from seqeval.metrics import performance_measure
        >>> y_true = [['O', 'O', 'O', 'B-MISC', 'I-MISC', 'O', 'B-ORG'], ['B-PER', 'I-PER', 'O']]
        >>> y_pred = [['O', 'O', 'B-MISC', 'I-MISC', 'I-MISC', 'O', 'O'], ['B-PER', 'I-PER', 'O']]
        >>> performance_measure(y_true, y_pred)
        (3, 3, 1, 4)

But when I run it, I get the following numbers:

from seqeval.metrics import performance_measure
y_true = [['O', 'O', 'O', 'B-MISC', 'I-MISC', 'O', 'B-ORG'], ['B-PER', 'I-PER', 'O']]
y_pred = [['O', 'O', 'B-MISC', 'I-MISC', 'I-MISC', 'O', 'O'], ['B-PER', 'I-PER', 'O']]
performance_measure(y_true, y_pred)
{'TP': 3, 'FP': 2, 'FN': 1, 'TN': 4}

If it's token level, then it should be:
{'TP': 4, 'FP': 1, 'FN': 1, 'TN': 4}
If it's entity level, then it should be:
{'TP': 1, 'FP': ??, 'FN': 1, 'TN': 4}

Can you explain these numbers?
How the partial match is handled?

mustfkeskin · 2021-03-23T15:03:58Z

I have same question how we can calculate confussion matrix using seqeval library

mirfan899 · 2021-05-21T17:48:23Z

I have the same question. I am working on token classification and results are confusing

{'eval_loss': 1.503118872642517, 'eval_precision': 0.2734958710184821, 'eval_recall': 0.16045680009228286, 'eval_f1': 0.20225372591784804, 'eval_accuracy': 0.8713822804442352, 'eval_runtime': 73.1268, 'eval_samples_per_second': 59.937, 'epoch': 17.0}

Eval accuracy is high and precision, recall, and f1 scores are very low. It seems there might be a bug related to computing the score at the entity level.

zingxy · 2021-09-30T16:11:50Z

@mirfan899 it `s just normal， because for token classification， the number of O label much higher than B label.

JanRodriguez · 2023-08-04T13:50:39Z

To complement what @zingxy said, accuracy is just "of all tokens, how many did I guess right?", with class O included. This makes it easy to have high/very high accuracies since most of them will usually be O.

On the other hand, the F1 score reported here is the micro average of the classes, without taking into account the O class. Check the numbers in the classification report.

rsuwaileh closed this as completed Mar 17, 2021

rsuwaileh reopened this Mar 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a way to print the confusion matrix #72

Is there a way to print the confusion matrix #72

rsuwaileh commented Mar 17, 2021

rsuwaileh commented Mar 17, 2021 •

edited

mustfkeskin commented Mar 23, 2021

mirfan899 commented May 21, 2021

zingxy commented Sep 30, 2021

JanRodriguez commented Aug 4, 2023 •

edited

Is there a way to print the confusion matrix #72

Is there a way to print the confusion matrix #72

Comments

rsuwaileh commented Mar 17, 2021

rsuwaileh commented Mar 17, 2021 • edited

mustfkeskin commented Mar 23, 2021

mirfan899 commented May 21, 2021

zingxy commented Sep 30, 2021

JanRodriguez commented Aug 4, 2023 • edited

rsuwaileh commented Mar 17, 2021 •

edited

JanRodriguez commented Aug 4, 2023 •

edited