INDEX
Explanations
keywords related to news reporting and social media interactions
expressions related to reporting or disclosing information
New Auto-Interp
Negative Logits
ngth
-0.87
umerous
-0.81
ĸļ
-0.80
terness
-0.76
foremost
-0.75
teasp
-0.71
alyst
-0.70
ridor
-0.70
allion
-0.69
achus
-0.69
POSITIVE LOGITS
anything
1.01
something
0.99
stuff
0.98
GMOs
0.95
abortions
0.93
things
0.89
THEIR
0.88
abortion
0.87
boobs
0.85
contraceptives
0.85
Activations Density 0.546%