INDEX
Explanations
words related to political or military control
New Auto-Interp
Negative Logits
enegger
-0.65
Tale
-0.65
Recommend
-0.63
Honour
-0.63
asus
-0.62
idered
-0.61
uni
-0.60
Hitman
-0.60
neys
-0.60
Bi
-0.59
POSITIVE LOGITS
eering
1.00
orship
0.96
ership
0.96
levers
0.87
ANCE
0.85
lessness
0.82
control
0.82
overs
0.78
ability
0.77
thereof
0.76
Activations Density 0.047%