INDEX
Explanations
phrases related to unexpected events or plot twists
phrases indicating a change or twist in events
New Auto-Interp
Negative Logits
VICE
-0.76
lain
-0.74
uncond
-0.69
leground
-0.67
Demand
-0.64
mobi
-0.64
ials
-0.62
ļéĨĴ
-0.60
Consumers
-0.59
Parenthood
-0.59
POSITIVE LOGITS
phrase
0.83
phrase
0.83
fortune
0.81
tides
0.73
hindsight
0.73
fate
0.72
dawn
0.71
fortunes
0.70
fortune
0.70
midnight
0.68
Activations Density 0.105%