INDEX
Explanations
phrases related to authority and control
elements related to addiction and dependency issues
New Auto-Interp
Negative Logits
qua
-0.72
Peaks
-0.68
paralle
-0.67
qa
-0.63
Press
-0.61
ebus
-0.60
echo
-0.58
Podcast
-0.58
framework
-0.58
Tyrann
-0.57
POSITIVE LOGITS
themselves
1.72
their
1.22
THEIR
1.08
selves
1.04
their
1.03
careers
1.01
voluntarily
0.98
utterstock
0.92
selves
0.89
Their
0.88
Activations Density 0.840%