INDEX
Explanations
the word "Popular."
instances of the word "popular" and its variations
New Auto-Interp
Negative Logits
thur
-0.84
Aviv
-0.77
aul
-0.72
ural
-0.67
thy
-0.67
xual
-0.66
uth
-0.66
aca
-0.65
Kear
-0.64
ermott
-0.63
POSITIVE LOGITS
ity
0.98
Popular
0.93
popular
0.88
ized
0.80
itarian
0.79
iatus
0.74
popular
0.71
majorities
0.71
izing
0.70
enterprises
0.70
Activations Density 0.013%