INDEX
Explanations
specific years or date ranges
years and dates related to events or durations of time
New Auto-Interp
Negative Logits
elight
-0.66
advant
-0.63
Tweet
-0.61
redients
-0.61
izers
-0.60
ication
-0.58
nce
-0.57
abet
-0.57
Sovere
-0.56
tuber
-0.55
POSITIVE LOGITS
respectively
1.13
inclusive
0.95
depending
0.93
periods
0.83
depending
0.83
totaling
0.80
onwards
0.75
timeframe
0.70
alike
0.69
averaged
0.68
Activations Density 0.088%