INDEX
Negative Logits
ment
-0.22
ments
-0.21
rou
-0.17
los
-0.17
ust
-0.17
ernaut
-0.17
baz
-0.16
by
-0.16
tings
-0.16
books
-0.15
POSITIVE LOGITS
therapy
0.28
active
0.27
activity
0.25
thon
0.24
frequency
0.24
stations
0.23
station
0.22
actively
0.22
head
0.21
Shack
0.20
Activations Density 0.013%