INDEX
Explanations
mentions of future actions
instances of the word "will" indicating future events or actions
New Auto-Interp
Negative Logits
reality
-0.65
abstraction
-0.62
amped
-0.62
Lear
-0.62
Mom
-0.61
ZI
-0.60
processing
-0.60
76561
-0.59
establishment
-0.59
engineering
-0.59
POSITIVE LOGITS
be
1.19
continue
1.05
undoubtedly
1.05
doubtless
1.04
likely
1.01
surely
0.97
gladly
0.95
probably
0.93
remain
0.93
definitely
0.92
Activations Density 0.205%