INDEX
Explanations
names of notable personalities, such as "DeVos" and "Dawkins."
mentions of specific individuals, particularly Betsy DeVos and Richard Dawkins
New Auto-Interp
Negative Logits
Polic
-0.79
Fant
-0.78
Leth
-0.71
Pike
-0.70
pneum
-0.68
Lith
-0.66
buoy
-0.65
Antar
-0.64
fishermen
-0.64
ppo
-0.62
POSITIVE LOGITS
liga
0.90
æł
0.86
etics
0.84
à¼
0.84
heimer
0.83
chool
0.81
ãĥ¼ãĥĨãĤ£
0.80
ulously
0.79
verse
0.76
hyde
0.75
Activations Density 0.021%