INDEX
Explanations
numeric references to the 19th century
references to the 19th century
New Auto-Interp
Negative Logits
aminer
-0.87
orc
-0.82
ellation
-0.79
plom
-0.78
qqa
-0.76
ensional
-0.75
igent
-0.75
plings
-0.74
endum
-0.72
ipment
-0.72
POSITIVE LOGITS
th
1.09
âĸĪâĸĪ
1.00
09
0.93
07
0.90
05
0.90
61
0.89
08
0.89
03
0.85
87
0.85
04
0.85
Activations Density 0.020%