INDEX
Explanations
expressions of indifference or lack of concern in various contexts
New Auto-Interp
Negative Logits
bhan
-0.62
τερα
-0.57
متحده
-0.55
timewa
-0.53
restriction
-0.51
rungsseite
-0.50
zubauen
-0.50
umina
-0.50
Ban
-0.50
restricted
-0.49
POSITIVE LOGITS
disregard
0.93
indifference
0.90
nonchal
0.85
ignoring
0.79
indifferent
0.77
ignore
0.77
Ignoring
0.76
apathy
0.75
shrug
0.75
Ignoring
0.74
Activations Density 0.397%