INDEX
Explanations
references to males in informal settings, often characterized by a certain casual attitude or behavior
references to the term "dude" in various contexts
New Auto-Interp
Negative Logits
SNP
-0.82
Labrador
-0.76
framework
-0.75
GST
-0.74
confidence
-0.68
ija
-0.67
Malt
-0.65
gow
-0.65
Strong
-0.65
Hilbert
-0.65
POSITIVE LOGITS
dude
1.60
dudes
1.50
Spy
1.32
nec
1.25
Actor
1.22
obar
1.21
ulent
1.07
VICE
1.00
Shepard
0.88
spawned
0.87
Activations Density 0.065%