INDEX
Explanations
unexpected or surprising events or developments
references to unexpected changes or turns in narrative or situations
New Auto-Interp
Negative Logits
pta
-0.77
ufact
-0.76
ogl
-0.73
Ducks
-0.69
Domain
-0.66
encia
-0.64
och
-0.63
inez
-0.61
arel
-0.61
EXP
-0.61
POSITIVE LOGITS
twist
1.20
twists
1.05
abouts
0.84
Twist
0.80
angle
0.77
twisted
0.76
Whedon
0.75
hered
0.74
angle
0.74
endings
0.73
Activations Density 0.081%