INDEX
Explanations
mentions of time durations, specifically focusing on years and decades
New Auto-Interp
Negative Logits
edIn
-0.65
este
-0.65
ibrary
-0.65
¬¼
-0.62
subp
-0.61
anooga
-0.61
Username
-0.59
Ft
-0.59
Ĥª
-0.58
Els
-0.56
POSITIVE LOGITS
2020
0.87
2021
0.86
iband
0.82
someday
0.81
izons
0.80
2019
0.80
olds
0.80
hereafter
0.79
glass
0.77
unless
0.75
Activations Density 0.071%