INDEX
Explanations
pronouns and their variations
New Auto-Interp
Negative Logits
ikt
-0.17
leck
-0.16
td
-0.15
s
-0.15
amp
-0.14
eyes
-0.13
ibase
-0.13
sian
-0.13
acent
-0.13
eldorf
-0.13
POSITIVE LOGITS
647
0.16
Cust
0.15
Powell
0.15
bsites
0.15
hausen
0.14
cust
0.14
powder
0.14
ève
0.14
Latch
0.14
eward
0.14
Activations Density 0.030%