INDEX
Explanations
verbs related to control, influence, or supremacy
terms related to dominance or control over something
New Auto-Interp
Negative Logits
endment
-0.68
spir
-0.66
dra
-0.62
zn
-0.62
pt
-0.62
ensions
-0.61
endi
-0.61
ead
-0.61
defect
-0.61
resso
-0.61
POSITIVE LOGITS
headlines
0.82
dominated
0.74
overshadowed
0.72
dominates
0.65
Ń·
0.64
charge
0.63
quez
0.62
pread
0.62
ICS
0.62
creen
0.62
Activations Density 0.037%