INDEX
Explanations
phrases related to future plans or actions
instances of the word "will" indicating future actions or outcomes
New Auto-Interp
Negative Logits
Syndrome
-0.66
Reporting
-0.66
amped
-0.66
Writing
-0.65
76561
-0.63
ukemia
-0.62
Mom
-0.61
Fever
-0.61
Shooting
-0.61
Pie
-0.59
POSITIVE LOGITS
likely
1.08
doubtless
1.07
undoubtedly
1.04
be
1.03
inevitably
0.99
soon
0.98
hopefully
0.98
probably
0.97
continue
0.97
eventually
0.96
Activations Density 0.201%