INDEX
Explanations
negations and refutations of statements
New Auto-Interp
Negative Logits
Weyl
-0.89
recto
-0.89
يتيمه
-0.88
DockStyle
-0.87
PTC
-0.84
homoto
-0.83
Sopho
-0.81
sorption
-0.81
fidu
-0.81
purpoſe
-0.81
POSITIVE LOGITS
is
1.29
a
1.03
not
0.98
being
0.97
are
0.96
was
0.94
è
0.93
becoming
0.91
also
0.91
isn
0.88
Activations Density 0.118%