INDEX
Explanations
phrases related to someone having control or possession over something
references to power or control held by specific groups or individuals
New Auto-Interp
Negative Logits
abbrevi
-0.58
doubling
-0.57
apor
-0.56
redo
-0.55
positives
-0.53
trending
-0.52
SEE
-0.52
anned
-0.50
atten
-0.50
confirming
-0.49
POSITIVE LOGITS
of
0.91
OF
0.75
hip
0.75
rils
0.73
lust
0.72
¿
0.71
Of
0.71
adle
0.68
OF
0.67
linger
0.67
Activations Density 0.058%