INDEX
Explanations
proper names like "Boyle" and "Higgins"
proper nouns, particularly names and locations
New Auto-Interp
Negative Logits
aneous
-0.87
ilage
-0.82
hra
-0.79
onym
-0.78
NAS
-0.77
drawer
-0.74
azeera
-0.73
ublic
-0.72
onyms
-0.71
elo
-0.71
POSITIVE LOGITS
ynski
0.86
Higgins
0.79
Remastered
0.78
ï¸ı
0.77
yip
0.75
abad
0.74
nodd
0.74
shed
0.74
ongyang
0.73
s
0.72
Activations Density 0.024%