INDEX
Explanations
phrases indicating passage of time over a period of years
occurrences of the word "the"
New Auto-Interp
Negative Logits
uala
-0.77
fy
-0.72
ulhu
-0.70
vu
-0.69
RIC
-0.65
Behind
-0.65
Boo
-0.65
Bio
-0.64
nces
-0.64
Bu
-0.63
POSITIVE LOGITS
weekend
0.99
horizon
0.88
holidays
0.87
course
0.84
ensuing
0.84
span
0.81
whole
0.80
hang
0.80
arching
0.79
threshold
0.79
Activations Density 0.083%