INDEX
Explanations
XML attributes and their values
New Auto-Interp
Negative Logits
651
-0.15
esk
-0.15
orld
-0.14
çij
-0.14
Ãĺ
-0.14
919
-0.14
itals
-0.14
traits
-0.14
onis
-0.14
kit
-0.14
POSITIVE LOGITS
znik
0.16
PD
0.15
-conscious
0.14
icina
0.14
ermann
0.14
ldr
0.14
貸
0.14
atura
0.14
lices
0.14
ábado
0.13
Activations Density 0.002%