INDEX
Explanations
statements asserting the truth or falsity of mathematical or logical propositions
Confirming truth of statements
New Auto-Interp
Negative Logits
GIVEREF
-0.56
IVEREF
-0.54
snippetHide
-0.54
Италијани
-0.52
uxxxx
-0.51
AndEndTag
-0.51
ंदीखरीदारी
-0.51
怎麼辦
-0.48
Gebet
-0.48
Bewußt
-0.48
POSITIVE LOGITS
false
0.65
False
0.64
false
0.59
FALSE
0.57
False
0.54
FALSE
0.54
True
0.49
true
0.47
statements
0.45
untrue
0.45
Activations Density 0.067%