INDEX
Explanations
phrases where someone is quoted speaking
instances of the pronoun "he" indicating a speaker or subject in discourse
New Auto-Interp
Negative Logits
lihood
-0.78
privile
-0.71
miscarriage
-0.66
Measure
-0.65
selection
-0.62
disproportion
-0.60
bookmark
-0.60
caliber
-0.59
ãĥij
-0.57
locating
-0.56
POSITIVE LOGITS
said
1.32
told
1.30
joked
1.18
wrote
1.17
explained
1.16
says
1.13
remarked
1.10
said
1.06
exclaimed
1.05
tweeted
1.04
Activations Density 0.056%