INDEX
Explanations
words related to history or past events
phrases that indicate historical context
New Auto-Interp
Negative Logits
Sahara
-0.78
mails
-0.71
him
-0.70
plan
-0.70
Pages
-0.68
gur
-0.68
ger
-0.68
Eva
-0.67
itas
-0.66
ity
-0.65
POSITIVE LOGITS
housed
0.95
conduc
0.86
dexter
0.86
wielded
0.84
exting
0.83
represented
0.83
entimes
0.82
speaking
0.81
©¶æ
0.81
conclud
0.80
Activations Density 0.019%