INDEX
Negative Logits
ikation
-0.74
Rüyada
-0.73
تقاوى
-0.69
ModelExpression
-0.68
μφωνα
-0.68
gridx
-0.67
findpost
-0.65
bacter
-0.65
MTI
-0.65
serangga
-0.64
POSITIVE LOGITS
opposition
0.84
Bop
0.82
oppos
0.81
opposed
0.81
opposition
0.80
Opposition
0.78
oppose
0.78
Opposition
0.76
Opa
0.75
Poh
0.75
Activations Density 0.007%