INDEX
Explanations
actions and interactions between characters in the narrative
New Auto-Interp
Negative Logits
raj
-0.15
ling
-0.15
itag
-0.14
ifdef
-0.14
ynom
-0.14
sentiment
-0.13
ynet
-0.13
ynch
-0.13
aina
-0.13
Callable
-0.13
POSITIVE LOGITS
hetto
0.17
echa
0.16
egrator
0.15
avaÅŁ
0.15
.rs
0.15
ieri
0.15
ugins
0.14
kili
0.14
lsi
0.14
obby
0.14
Activations Density 0.700%