INDEX
Explanations
terms related to control and authority
phrases related to power and dominance
New Auto-Interp
Negative Logits
à¤
-0.68
Recommend
-0.67
Bi
-0.65
TOM
-0.63
Masquerade
-0.63
Intel
-0.62
é¾įå
-0.62
Matt
-0.61
Mich
-0.61
Tale
-0.60
POSITIVE LOGITS
vier
0.92
orship
0.83
control
0.82
iveness
0.80
ership
0.80
eering
0.78
freak
0.77
taker
0.74
essee
0.74
rador
0.73
Activations Density 0.027%