INDEX
Negative Logits
للمعارف
-1.03
myſelf
-1.02
raiſ
-1.02
reaſon
-0.97
itſelf
-0.97
Jefus
-0.97
хьтан
-0.96
pleaſure
-0.96
cauſe
-0.95
GenerationType
-0.94
POSITIVE LOGITS
SuccessListener
0.42
(
0.41
.
0.41
ask
0.41
use
0.40
test
0.39
if
0.39
the
0.39
as
0.38
↵
0.38
Activations Density 0.035%