INDEX
Explanations
phrases indicating important events or moments in time
instances of the word "when."
New Auto-Interp
Negative Logits
ertain
-0.76
whatever
-0.70
plus
-0.67
ggles
-0.66
vre
-0.66
bear
-0.65
augh
-0.64
agin
-0.63
ilus
-0.63
Grade
-0.62
POSITIVE LOGITS
soever
1.27
they
0.88
upon
0.84
confronted
0.77
hackers
0.75
someone
0.73
hordes
0.73
suddenly
0.72
he
0.70
gunmen
0.69
Activations Density 0.106%