INDEX
Explanations
references to political figures and events
New Auto-Interp
Negative Logits
ibly
-0.85
ipel
-0.80
ĨĴ
-0.76
iors
-0.74
phot
-0.71
ONSORED
-0.69
ozo
-0.68
photos
-0.66
asaki
-0.65
andowski
-0.64
POSITIVE LOGITS
mortar
0.74
arrows
0.74
bolts
0.70
Betsy
0.69
arrow
0.68
gown
0.67
heels
0.66
parcel
0.65
potatoes
0.65
Sund
0.64
Activations Density 1.125%