INDEX
Explanations
numbers within texts
phrases or sentences that indicate questions or inquiries
New Auto-Interp
Negative Logits
wagen
-0.84
creen
-0.74
oulos
-0.71
Canaver
-0.66
nesday
-0.66
unchecked
-0.66
Asheville
-0.66
guiActiveUnfocused
-0.65
Mellon
-0.65
general
-0.64
POSITIVE LOGITS
________________________________________________________________
1.08
~~~~~~~~~~~~~~~~
0.97
É
0.94
------------------------
0.93
Ëľ
0.93
---------------
0.92
********
0.92
****************
0.92
~~~~~~~~
0.91
ãĥĪ
0.91
Activations Density 0.005%