INDEX
Explanations
mentions of unexpected plot twists in textual content
occurrences of the word "twist" in various contexts
New Auto-Interp
Negative Logits
league
-0.74
ufact
-0.64
Ducks
-0.64
hammad
-0.63
inately
-0.63
Male
-0.63
particip
-0.62
rahim
-0.62
asking
-0.61
estine
-0.61
POSITIVE LOGITS
twist
1.24
twists
1.14
Twist
0.91
endings
0.84
twisting
0.83
Whedon
0.75
stroke
0.73
weave
0.73
abouts
0.72
bend
0.72
Activations Density 0.014%