INDEX
Explanations
words related to a specific term "Snopes"
the presence of a specific name or label repeatedly mentioned in the text
New Auto-Interp
Negative Logits
PowerPoint
-0.76
Hezbollah
-0.75
TAIN
-0.75
66666666
-0.74
xual
-0.73
heid
-0.72
Sunni
-0.71
upon
-0.68
ãĥĩ
-0.66
CPI
-0.64
POSITIVE LOGITS
ipers
1.06
uggle
1.05
ugg
1.04
Sn
1.04
icket
1.04
agging
1.02
igger
1.00
omore
0.98
utt
0.98
agged
0.96
Activations Density 0.006%