INDEX
Negative Logits
emery
0.44
his
0.41
okat
0.41
],
0.41
sprouts
0.39
>
0.38
a
0.38
sebagainya
0.37
bts
0.37
]].
0.37
POSITIVE LOGITS
(*)
0.59
(*
0.59
*(
0.59
**,
0.58
*,
0.57
(**
0.55
**,
0.53
*)
0.50
\*
0.50
$*$
0.50
Activations Density 0.001%
emery
his
okat
],
sprouts
>
a
sebagainya
bts
]].
(*)
(*
*(
**,
*,
(**
**,
*)
\*
$*$