INDEX
Explanations
proper nouns related to people's names
repeated mentions of specific names, particularly "Jacobs" and "Brun."
New Auto-Interp
Negative Logits
mble
-0.95
aceutical
-0.85
atively
-0.73
ivid
-0.71
ocally
-0.69
raq
-0.67
umin
-0.67
ograp
-0.67
ãĥī
-0.66
Warfare
-0.66
POSITIVE LOGITS
Brun
0.82
ton
0.76
tones
0.70
hou
0.69
ernaut
0.69
son
0.69
aign
0.68
Jacobs
0.68
mand
0.68
ette
0.68
Activations Density 0.028%