INDEX
Negative Logits
The
0.43
Accuracy
0.42
liczba
0.42
ECTION
0.41
ruzione
0.40
ห
0.40
그러나
0.39
وكان
0.39
ත
0.39
CONF
0.38
POSITIVE LOGITS
extensively
0.63
actively
0.57
적극
0.57
積極的に
0.55
proactively
0.54
consciously
0.52
активно
0.52
intentionally
0.50
প্রতিশ্রুতি
0.47
frequently
0.46
Activations Density 0.012%