INDEX
Negative Logits
weken
0.47
kiş
0.46
gigante
0.45
但我
0.45
vaše
0.44
我想
0.44
ER
0.44
OT
0.43
jumbo
0.43
siècles
0.43
POSITIVE LOGITS
arrangements
0.52
sacks
0.52
replies
0.46
meats
0.46
resolves
0.45
affair
0.45
blasts
0.44
flanks
0.44
laces
0.44
appears
0.43
Activations Density 0.008%