INDEX
Negative Logits
Scor
-0.08
area
-0.07
496
-0.07
call
-0.07
umes
-0.07
Gaul
-0.07
oma
-0.06
sexual
-0.06
nấu
-0.06
Mostly
-0.06
POSITIVE LOGITS
independent
0.13
Independent
0.12
Independence
0.11
independence
0.11
Independent
0.11
independ
0.10
independents
0.10
독
0.10
Independ
0.09
independently
0.09
Activations Density 0.015%