INDEX
Explanations
phrases related to critical or negative assessments
phrases indicating long-term relationships or commitments
New Auto-Interp
Negative Logits
charms
-0.86
Clubs
-0.80
domains
-0.80
magazines
-0.76
radios
-0.76
establishments
-0.75
clubs
-0.74
idols
-0.74
launchers
-0.73
embassies
-0.72
POSITIVE LOGITS
exclusive
1.20
sized
1.20
alone
1.04
intensive
1.04
filled
1.01
powered
1.00
themed
0.99
fashioned
0.97
edged
0.97
centric
0.97
Activations Density 0.209%