INDEX
Explanations
phrases indicating apathy or detachment from social issues
New Auto-Interp
Negative Logits
XmlAccessType
-0.78
WriteBarrier
-0.77
onOptions
-0.58
Vikipedi
-0.56
vocales
-0.56
tır
-0.53
ثيق
-0.53
dAtA
-0.52
RuleContext
-0.52
متعلقه
-0.52
POSITIVE LOGITS
nonchal
0.91
shrug
0.82
calmly
0.82
indifference
0.81
indiffer
0.80
indifferent
0.77
indifer
0.73
余裕
0.69
calmness
0.67
shrugs
0.65
Activations Density 0.087%