INDEX
Explanations
instances of claims, assertions, and denials
New Auto-Interp
Negative Logits
á»ĩu
-0.15
oret
-0.15
appen
-0.15
aná
-0.15
alent
-0.15
gam
-0.14
.expect
-0.14
.innerHeight
-0.14
nj
-0.14
udden
-0.14
POSITIVE LOGITS
otherwise
0.32
there
0.29
they
0.27
otherwise
0.26
Otherwise
0.25
differently
0.24
it
0.24
OTHERWISE
0.22
we
0.21
Otherwise
0.20
Activations Density 0.201%