INDEX
Explanations
statements asserting the existence or non-existence of something
New Auto-Interp
Negative Logits
AssemblyCulture
-0.83
itſelf
-0.82
SharedCtor
-0.79
Cæsar
-0.74
bewerken
-0.70
Shakspeare
-0.70
themſelves
-0.67
ſtate
-0.67
theſe
-0.65
houſe
-0.64
POSITIVE LOGITS
no
1.32
a
1.24
plenty
1.10
an
1.09
some
1.04
lots
0.97
ample
0.87
nothing
0.85
another
0.83
little
0.76
Activations Density 0.163%