INDEX
Explanations
the last names of famous or notable individuals
proper nouns and elements related to celebrity or notable figures
New Auto-Interp
Negative Logits
ashtra
-0.85
ulhu
-0.83
auga
-0.83
sels
-0.83
urat
-0.82
lehem
-0.79
asar
-0.74
ersen
-0.74
iov
-0.72
awed
-0.72
POSITIVE LOGITS
customs
0.62
McKay
0.61
herb
0.58
Shotgun
0.58
tackling
0.58
pickup
0.57
toile
0.57
shotgun
0.55
cab
0.55
sprung
0.55
Activations Density 0.340%