INDEX
Explanations
phrases related to unsupported or unproven claims
words and phrases that indicate claims or concerns that are lacking in evidence or validity
New Auto-Interp
Negative Logits
ebus
-0.96
azine
-0.84
itte
-0.81
ophon
-0.79
hedral
-0.76
odan
-0.75
emis
-0.72
lite
-0.71
reon
-0.70
igslist
-0.69
POSITIVE LOGITS
unfounded
0.88
optimism
0.85
nesses
0.81
belief
0.81
indignation
0.79
suspicions
0.79
speculation
0.77
accusations
0.77
curiosity
0.77
outrage
0.77
Activations Density 0.018%