INDEX
Explanations
statements with uncertainty or speculation
phrases related to evidence and claims in discussions
New Auto-Interp
Negative Logits
inav
-0.65
pione
-0.64
ãĥĩãĤ£
-0.62
arse
-0.62
ãĤ¨ãĥ«
-0.62
subur
-0.61
ãĤ´ãĥ³
-0.60
ãĤ¼ãĤ¦ãĤ¹
-0.57
ãĥĵ
-0.57
è»
-0.55
POSITIVE LOGITS
it
0.57
Americans
0.56
anyone
0.54
individuals
0.54
Britons
0.54
tensions
0.53
improper
0.53
landlords
0.52
people
0.51
certain
0.51
Activations Density 0.912%