INDEX
Explanations
terms and phrases related to proof or evidence in a context of argumentation or claims
New Auto-Interp
Negative Logits
ecycle
-0.84
tones
-0.78
sie
-0.71
ascus
-0.69
asms
-0.66
swer
-0.66
ophon
-0.65
idays
-0.64
bid
-0.63
otin
-0.63
POSITIVE LOGITS
thereof
0.89
ially
0.73
alled
0.72
lessly
0.71
proof
0.71
conclusive
0.70
alling
0.68
bias
0.68
rence
0.66
tampering
0.64
Activations Density 0.027%