INDEX
Explanations
references to specific years in the 19th and 20th centuries
references to the 19th century and its events
New Auto-Interp
Negative Logits
aminer
-0.76
pora
-0.70
orc
-0.70
anguage
-0.69
ité
-0.68
heed
-0.66
afort
-0.64
enture
-0.63
anie
-0.62
backfield
-0.61
POSITIVE LOGITS
âĸĪâĸĪ
1.09
th
1.01
61
0.91
08
0.89
07
0.88
05
0.88
03
0.88
09
0.86
06
0.85
04
0.84
Activations Density 0.039%