INDEX
Explanations
names of people, particularly those with last names commonly associated with color (e.g., White, Black, Green)
the last names of notable individuals
New Auto-Interp
Negative Logits
VIDE
-0.68
srfAttach
-0.66
polarization
-0.62
psi
-0.61
Haram
-0.60
Serie
-0.59
BOX
-0.59
Hydra
-0.58
Pastebin
-0.58
masked
-0.58
POSITIVE LOGITS
stein
1.36
berger
1.34
croft
1.34
gren
1.32
cott
1.32
hill
1.30
worth
1.29
baum
1.29
field
1.28
burn
1.27
Activations Density 0.154%