INDEX
Explanations
phrases comparing the past to the present or future
New Auto-Interp
Negative Logits
ographies
-0.59
eye
-0.58
Pak
-0.56
rich
-0.54
jee
-0.54
chin
-0.53
jiang
-0.53
oris
-0.53
folios
-0.51
letters
-0.51
POSITIVE LOGITS
earlier
1.18
yesterday
1.12
before
1.09
prior
1.05
previously
1.04
beforehand
0.95
originally
0.92
during
0.91
BEFORE
0.87
ufact
0.85
Activations Density 0.184%