INDEX
Explanations
verbs or nouns that convey a strong sense of challenging established norms or beliefs
terms that indicate defiance or resistance against norms or systems
New Auto-Interp
Negative Logits
praying
-0.72
dressing
-0.69
lug
-0.69
donating
-0.68
booking
-0.66
scra
-0.65
withdrawing
-0.64
parting
-0.64
sweating
-0.64
renting
-0.63
POSITIVE LOGITS
uates
1.17
ifies
1.12
olves
1.08
iates
1.04
icates
1.04
etheless
1.03
tains
1.01
cludes
0.98
ounded
0.97
ould
0.94
Activations Density 0.165%