INDEX
Explanations
discussions related to political and social power dynamics
New Auto-Interp
Negative Logits
cheng
-0.13
ä½³
-0.13
ä¼ĺ
-0.13
paci
-0.13
velopment
-0.12
ux
-0.12
Registrar
-0.12
ождениÑı
-0.12
ç³ĸ
-0.12
lạc
-0.12
POSITIVE LOGITS
Conspiracy
0.28
conspiracy
0.27
Bilder
0.25
Truth
0.25
UFO
0.24
MSM
0.24
conspir
0.24
truth
0.24
Roths
0.23
911
0.23
Activations Density 0.479%