INDEX
Explanations
dates represented in a specific format, possibly related to historical events
numerical data and statistics related to specific years
New Auto-Interp
Negative Logits
pora
-0.75
aminer
-0.73
heed
-0.70
iants
-0.69
ogical
-0.69
anguage
-0.68
backfield
-0.66
arious
-0.66
ité
-0.65
enture
-0.65
POSITIVE LOGITS
âĸĪâĸĪ
1.03
61
0.93
08
0.92
07
0.89
09
0.88
05
0.88
06
0.88
04
0.86
03
0.86
th
0.86
Activations Density 0.040%