INDEX
Explanations
first-person statements and expressions of identity
New Auto-Interp
Negative Logits
atsu
-0.16
inger
-0.15
pig
-0.15
nger
-0.15
activex
-0.15
issant
-0.14
sworth
-0.14
dac
-0.14
INGER
-0.14
ãĤ¢ãĥ«
-0.14
POSITIVE LOGITS
chalk
0.18
529
0.15
vement
0.15
ikat
0.15
Prec
0.14
Dillon
0.14
ÑĪки
0.14
Walls
0.14
Maver
0.14
Fine
0.13
Activations Density 0.254%