INDEX
Negative Logits
statutory
-0.07
services
-0.07
'
-0.06
_<
-0.06
paid
-0.06
聚
-0.06
soaking
-0.06
Thường
-0.06
いう
-0.06
Sorted
-0.06
POSITIVE LOGITS
—the
0.18
—but
0.17
—and
0.17
—a
0.16
—it
0.16
—in
0.15
—an
0.15
—is
0.15
—with
0.15
—which
0.14
Activations Density 0.009%