INDEX
Explanations
words related to conspiracy theories
negative contractions indicating negation or denial
New Auto-Interp
Negative Logits
looms
-0.74
liaison
-0.65
initiative
-0.63
circulation
-0.63
successes
-0.62
decimal
-0.62
imitation
-0.61
promoters
-0.61
accum
-0.61
Mirage
-0.61
POSITIVE LOGITS
ï¸ı
1.10
Balt
0.89
ï¸
0.88
İ
0.86
âĢł
0.83
sure
0.81
ç
0.81
sorry
0.81
culus
0.78
yet
0.78
Activations Density 0.163%