INDEX
Explanations
numeric values in an unusual notation
special characters or symbols and abnormal characters in the text
New Auto-Interp
Negative Logits
NetMessage
-0.89
emouth
-0.88
hitch
-0.83
nect
-0.81
anooga
-0.78
istically
-0.78
fman
-0.77
mercial
-0.76
glers
-0.75
berra
-0.75
POSITIVE LOGITS
³
0.95
ת
0.92
Į
0.91
Ö¼
0.89
——
0.86
ÙĨ
0.86
à¥
0.84
²
0.83
ा
0.82
ÙĦ
0.81
Activations Density 0.005%