INDEX
Negative Logits
æħĭ
-0.15
brew
-0.15
олоÑĤ
-0.15
ök
-0.15
aan
-0.14
ensis
-0.14
leagues
-0.14
830
-0.14
ot
-0.14
ortex
-0.13
POSITIVE LOGITS
sticks
0.23
winner
0.22
vice
0.19
ths
0.18
crumbs
0.18
orne
0.17
alyzer
0.17
basket
0.17
win
0.16
nut
0.16
Activations Density 0.009%