INDEX
Explanations
phrases related to historical events, political movements, and influential figures
instances of the word "great" in various contexts
New Auto-Interp
Negative Logits
ople
-0.90
chedel
-0.73
Leilan
-0.73
eters
-0.70
clips
-0.69
utical
-0.68
©¶æ
-0.68
meta
-0.66
ilus
-0.66
ratom
-0.65
POSITIVE LOGITS
sword
1.03
strides
0.95
itud
0.84
apes
0.76
reek
0.76
pains
0.75
Dane
0.74
proportions
0.74
misfortune
0.73
axe
0.71
Activations Density 0.036%