INDEX
Explanations
terms related to historical events or references
mentions of "history" and related concepts
New Auto-Interp
Negative Logits
igans
-0.70
vae
-0.68
oner
-0.67
autions
-0.66
enger
-0.66
leased
-0.63
ECA
-0.63
aton
-0.62
orc
-0.62
weeney
-0.62
POSITIVE LOGITS
buffs
1.12
textbooks
1.09
lesson
0.98
books
0.96
repeats
0.92
repeating
0.88
orians
0.82
documentaries
0.82
making
0.79
museums
0.77
Activations Density 0.052%