INDEX
Negative Logits
sWith
-0.30
ski
-0.30
t
-0.28
sk
-0.26
ta
-0.26
tion
-0.26
site
-0.26
sdale
-0.25
td
-0.25
ship
-0.25
POSITIVE LOGITS
presso
0.28
apeake
0.28
earch
0.26
ey
0.25
aurus
0.24
ee
0.23
ellschaft
0.23
ek
0.23
cent
0.22
pecially
0.21
Activations Density 0.087%