INDEX
Explanations
actions related to taking or seizing opportunities or control
New Auto-Interp
Negative Logits
onso
-0.17
IRO
-0.16
uzzle
-0.15
jÃł
-0.15
Ùĥرة
-0.15
uale
-0.15
FP
-0.14
aska
-0.14
activex
-0.14
irable
-0.14
POSITIVE LOGITS
van
0.17
ets
0.15
oca
0.15
artificial
0.14
opy
0.14
posit
0.14
oman
0.14
orie
0.14
unic
0.14
ingu
0.14
Activations Density 0.020%