INDEX
Negative Logits
Pretty
0.81
faciliter
0.78
Setup
0.75
सश
0.74
handy
0.73
facilite
0.70
Pretty
0.70
Ready
0.67
utili
0.67
Functions
0.67
POSITIVE LOGITS
correctness
0.91
Mathematics
0.90
accuracy
0.84
corrected
0.82
brit
0.79
english
0.77
Mathematics
0.76
british
0.75
surpasses
0.75
corrected
0.74
Activations Density 0.127%