INDEX
Explanations
references to individuals or groups with specific attributes or roles
occurrences of the word "ards."
New Auto-Interp
Negative Logits
ONSORED
-0.70
ĸļ
-0.65
ĺħ
-0.65
gd
-0.65
emb
-0.62
Podesta
-0.61
wei
-0.61
trak
-0.60
amen
-0.57
mington
-0.57
POSITIVE LOGITS
ards
1.12
hips
0.94
ragon
0.89
ystem
0.86
hire
0.86
creen
0.86
inas
0.83
hip
0.82
ard
0.80
nesses
0.79
Activations Density 0.012%