INDEX
Negative Logits
Wax
-0.08
bijzondere
-0.08
bygg
-0.08
wax
-0.08
글
-0.08
fita
-0.07
unusual
-0.07
WTF
-0.07
christmas
-0.07
Tucson
-0.07
POSITIVE LOGITS
まして
0.08
ENCY
0.08
tofu
0.08
cyclists
0.08
.Fore
0.07
joined
0.07
cyclist
0.07
stück
0.07
closures
0.07
Chimp
0.07
Activations Density 0.008%