INDEX
Explanations
instances of personal pronouns and references to self
New Auto-Interp
Negative Logits
295
-0.15
ález
-0.14
enga
-0.14
ãģĭãģĹ
-0.14
Floors
-0.14
227
-0.13
ansi
-0.13
اÙĬØ´
-0.13
Moms
-0.13
young
-0.13
POSITIVE LOGITS
gni
0.18
itur
0.17
itor
0.16
PECT
0.15
oji
0.15
abych
0.15
aday
0.15
dorf
0.15
utz
0.15
epar
0.15
Activations Density 0.194%