INDEX
Explanations
proper nouns related to people's names, specifically focusing on "Mam" and "Glasziou"
mentions of specific individuals or names
New Auto-Interp
Negative Logits
kson
-0.71
orld
-0.69
diarr
-0.67
wrapper
-0.66
¯¯¯¯
-0.64
culosis
-0.62
eals
-0.59
ciating
-0.59
personalized
-0.59
ighth
-0.59
POSITIVE LOGITS
Mam
1.18
umin
0.91
amia
0.90
uz
0.85
lu
0.80
ueller
0.79
udeau
0.78
matical
0.76
wit
0.76
uj
0.75
Activations Density 0.013%