INDEX
Explanations
dates or time periods
temporal references related to starting points in time
New Auto-Interp
Negative Logits
abled
-0.76
namese
-0.74
buster
-0.73
ken
-0.71
soType
-0.71
ascript
-0.71
osponsors
-0.65
Names
-0.64
erness
-0.63
luent
-0.63
POSITIVE LOGITS
earnest
0.84
dusk
0.66
anew
0.65
takeoff
0.64
here
0.63
daylight
0.63
McMaster
0.62
Gutenberg
0.61
prelim
0.61
sporadic
0.59
Activations Density 0.158%