INDEX
Explanations
proper nouns related to individuals or organizations
the repeated mention of the name "Da" across various contexts
New Auto-Interp
Negative Logits
sburgh
-1.16
ãĤ¡
-0.86
ï¸ı
-0.78
sburg
-0.77
é¾įå¥ij士
-0.75
ORED
-0.74
guiActiveUnfocused
-0.73
LESS
-0.72
eering
-0.71
eele
-0.70
POSITIVE LOGITS
isy
1.03
emon
1.02
ft
0.94
emonic
0.88
uman
0.85
cha
0.84
iba
0.84
qu
0.83
Da
0.81
fts
0.81
Activations Density 0.004%