INDEX
Explanations
statements regarding existence or being in various states
New Auto-Interp
Negative Logits
äter
-0.15
ENÃį
-0.15
asic
-0.14
aris
-0.14
arness
-0.14
TriState
-0.14
.TO
-0.14
renom
-0.14
899
-0.14
sido
-0.14
POSITIVE LOGITS
able
0.30
unable
0.23
revealed
0.21
born
0.18
Unable
0.18
replaced
0.18
transformed
0.17
Able
0.17
activated
0.17
unable
0.17
Activations Density 0.425%