INDEX
Explanations
words related to specific names or persons
the presence of the name "Al" in various contexts
New Auto-Interp
Negative Logits
disappearing
-0.67
membership
-0.64
ãģ¯
-0.63
operating
-0.62
underwater
-0.62
transporting
-0.61
amounts
-0.60
soaring
-0.60
surging
-0.59
opaque
-0.59
POSITIVE LOGITS
mberg
0.96
antz
0.93
ickson
0.92
bath
0.91
bre
0.91
bery
0.90
stadt
0.88
verson
0.86
jac
0.85
zman
0.85
Activations Density 0.122%