INDEX
Explanations
proper nouns representing a specific person
references to a specific male individual in various contexts
New Auto-Interp
Negative Logits
iculty
-0.63
DAY
-0.61
Interest
-0.58
affiliate
-0.58
reshold
-0.56
com
-0.56
terms
-0.56
Articles
-0.56
Jindal
-0.55
requisite
-0.55
POSITIVE LOGITS
zbollah
1.21
resy
1.04
'll
1.03
reditary
1.03
Majesty
0.96
uristic
0.95
gemony
0.93
pherd
0.92
eded
0.91
aven
0.91
Activations Density 0.282%