INDEX
Explanations
names of individuals or characters
proper names and references related to sports and notable individuals
New Auto-Interp
Negative Logits
esis
-0.79
houses
-0.78
edly
-0.73
house
-0.72
edIn
-0.72
lein
-0.71
holes
-0.70
cliffe
-0.69
laughs
-0.68
hips
-0.68
POSITIVE LOGITS
OPLE
0.84
pty
0.82
amina
0.82
uations
0.79
Fey
0.74
wark
0.73
````
0.71
vier
0.71
Anim
0.69
cture
0.69
Activations Density 0.124%