INDEX
Negative Logits
respectively
-0.31
$.
-0.29
]).
-0.28
Footnote
-0.28
Ire
-0.27
SPONSORED
-0.26
âĵĺ
-0.26
Interstitial
-0.26
senal
-0.25
mosqu
-0.24
POSITIVE LOGITS
:=
0.23
reads
0.21
!:
0.20
own
0.19
acre
0.19
Fail
0.19
ren
0.18
START
0.18
bah
0.18
nutshell
0.18
Activations Density 3.278%