INDEX
Explanations
phrases indicating the timing of events
instances of the word "Earlier" to indicate time references
New Auto-Interp
Negative Logits
peace
-0.68
Oracle
-0.67
Sov
-0.67
Bloom
-0.67
confidence
-0.66
Chance
-0.66
IRC
-0.65
NW
-0.65
observers
-0.64
caution
-0.64
POSITIVE LOGITS
Though
1.87
Previously
1.87
Currently
1.80
Several
1.79
Despite
1.78
Originally
1.77
Although
1.77
Already
1.76
Earlier
1.74
Since
1.70
Activations Density 0.129%