INDEX
Negative Logits
straw
-0.16
Straw
-0.15
ahun
-0.14
Everett
-0.14
arching
-0.14
zier
-0.14
Attrib
-0.14
omn
-0.13
eyer
-0.13
memberof
-0.13
POSITIVE LOGITS
umu
0.18
WithOptions
0.16
upo
0.16
enberg
0.15
]|[
0.14
uyá»ģn
0.14
merits
0.14
Gale
0.13
Island
0.13
ine
0.13
Activations Density 0.002%