INDEX
Explanations
instances of doubt and references to emotional states or relationships
New Auto-Interp
Negative Logits
AccessorTable
-0.89
')['
-0.77
__":
-0.69
AutoresizingMask
-0.68
makeConstraints
-0.60
alnız
-0.59
avía
-0.59
rašymas
-0.57
พาะ
-0.57
stdc
-0.57
POSITIVE LOGITS
never
1.12
never
1.12
Never
1.11
jamás
1.11
NEVER
1.10
Never
1.10
jamais
1.04
NEVER
1.03
nunca
0.96
nooit
0.90
Activations Density 0.122%