INDEX
Explanations
email addresses, specifically those containing the sequence "podesta"
the presence of proper nouns or names related to individuals, specifically those that are commonly associated with media or public figures
New Auto-Interp
Negative Logits
captcha
-0.65
neutrality
-0.65
lication
-0.64
pelled
-0.64
liners
-0.63
rified
-0.63
Boolean
-0.62
existential
-0.62
drawn
-0.62
sided
-0.60
POSITIVE LOGITS
esta
1.36
ieri
0.87
ãĥ¼ãĥĨãĤ£
0.83
Luxem
0.83
UTION
0.79
utive
0.78
ppo
0.77
ption
0.77
zza
0.76
ERT
0.76
Activations Density 0.015%