INDEX
Explanations
proper nouns
the repeated mention of the name "Da" in various contexts
New Auto-Interp
Negative Logits
sburgh
-1.15
sburg
-0.82
ãĤ¡
-0.80
eele
-0.73
ï¸ı
-0.72
guiActiveUnfocused
-0.71
é¾įå¥ij士
-0.70
LESS
-0.70
ments
-0.70
eering
-0.69
POSITIVE LOGITS
emon
1.11
emonic
1.05
isy
1.03
ft
0.97
uman
0.91
uthor
0.86
cha
0.86
iba
0.85
fts
0.85
plin
0.85
Activations Density 0.008%