INDEX
Explanations
negative sentiments or problem-related phrases
New Auto-Interp
Negative Logits
Autoritní
-0.49
すべての
-0.42
таратура
-0.41
رشف
-0.41
全ての
-0.41
-0.40
KURZBESCHREIBUNG
-0.38
nonché
-0.37
kmal
-0.35
tagPool
-0.35
POSITIVE LOGITS
either
4.75
either
4.22
Either
4.16
Either
4.09
entweder
3.48
要么
2.77
enten
2.48
либо
2.44
ITHER
2.42
soit
2.03
Activations Density 1.862%