INDEX
Explanations
significant actions and relationships in a narrative context
New Auto-Interp
Negative Logits
arrant
-0.18
_soc
-0.15
errer
-0.14
æ°
-0.14
verm
-0.14
.RESULT
-0.14
edor
-0.14
ssp
-0.14
nÃło
-0.14
quat
-0.14
POSITIVE LOGITS
Tup
0.17
alach
0.16
Grove
0.15
oes
0.15
COP
0.15
ipers
0.14
dent
0.14
Cop
0.14
rio
0.14
ê³µ
0.14
Activations Density 0.039%