INDEX
Explanations
questions or statements about the identity or actions of individuals
instances of the phrase "who's" as well as related questions about identity and responsibility
New Auto-Interp
Negative Logits
avascript
-0.79
nothing
-0.71
tesque
-0.70
earchers
-0.67
olson
-0.65
APD
-0.65
urances
-0.63
Heights
-0.63
iosity
-0.62
obi
-0.62
POSITIVE LOGITS
responsible
1.01
footing
0.97
whom
0.95
hardest
0.88
closest
0.87
benefiting
0.86
accountable
0.84
responsible
0.83
fault
0.81
sponsoring
0.81
Activations Density 0.156%