INDEX
Explanations
references to the collective perspective or shared experience
New Auto-Interp
Negative Logits
themſelves
-0.68
myſelf
-0.66
DBNull
-0.65
raiſ
-0.65
himſelf
-0.64
chofe
-0.63
Jefus
-0.61
cdti
-0.60
preghiera
-0.59
Efq
-0.59
POSITIVE LOGITS
pem
0.59
Ros
0.58
entown
0.55
محفوظة
0.54
Datuak
0.53
cessed
0.53
favourable
0.52
kuuta
0.52
__':
0.52
Coyle
0.52
Activations Density 0.044%