INDEX
Explanations
references to self-directed actions or self-awareness
New Auto-Interp
Negative Logits
self
-0.79
Self
-0.69
SELF
-0.63
Self
-0.61
itself
-0.57
self
-0.55
Selbst
-0.53
zelf
-0.51
Selbst
-0.50
riwal
-0.47
POSITIVE LOGITS
doInBackground
0.88
}}"></
0.84
,
0.79
ویکیپدیای
0.75
']?>
0.73
Collegamenti
0.72
)}</
0.72
endphp
0.71
مزید
0.71
})()
0.70
Activations Density 0.011%