INDEX
Explanations
expressions of emotional responses or reactions to experiences
past tense evaluations of quality
New Auto-Interp
Negative Logits
fjspx
-0.45
Autowired
-0.40
timi
-0.39
zeiro
-0.37
anillo
-0.36
miroir
-0.36
છે
-0.34
kia
-0.34
وي
-0.34
ts
-0.33
POSITIVE LOGITS
was
0.84
было
0.71
było
0.68
wasnt
0.65
était
0.65
wasn
0.64
была
0.61
وكان
0.61
הייתה
0.60
była
0.58
Activations Density 0.242%