INDEX
Explanations
references to personal ownership or possession
New Auto-Interp
Negative Logits
rang
-0.15
nya
-0.15
mav
-0.15
.addProperty
-0.14
mund
-0.14
nable
-0.14
s
-0.14
ONS
-0.13
ivec
-0.13
gether
-0.13
POSITIVE LOGITS
anmar
0.23
rtle
0.20
opia
0.20
SELF
0.19
embros
0.18
adows
0.17
edom
0.16
zell
0.16
zelf
0.16
opic
0.16
Activations Density 0.128%