INDEX
Explanations
phrases indicating the general truth or validity of a statement
statements affirming or asserting truth
New Auto-Interp
Negative Logits
Pages
-0.80
adish
-0.76
rador
-0.75
uled
-0.75
Sunder
-0.70
emetery
-0.70
Quit
-0.70
vernment
-0.69
acco
-0.68
RAW
-0.68
POSITIVE LOGITS
believer
0.81
believers
0.80
terday
0.75
rill
0.72
Lutheran
0.69
hood
0.65
emancipation
0.64
stic
0.64
positives
0.64
patriot
0.62
Activations Density 0.022%