INDEX
Explanations
years or numerical values related to specific events or entities
numerical values related to years or statistical data
New Auto-Interp
Negative Logits
raltar
-0.83
ledged
-0.81
otle
-0.80
orius
-0.79
sonian
-0.77
awed
-0.76
urger
-0.76
intosh
-0.75
amaru
-0.73
fall
-0.73
POSITIVE LOGITS
nd
1.77
ND
0.93
80
0.82
thirds
0.81
naire
0.80
50
0.78
ipop
0.77
entary
0.75
ength
0.75
mph
0.74
Activations Density 0.098%