INDEX
Explanations
references to someone taking an action or making a request towards another person
references to personal pronouns and their relationships to actions or requests
New Auto-Interp
Negative Logits
dawn
-0.63
Course
-0.62
Firm
-0.61
Nah
-0.59
AMI
-0.59
Sina
-0.59
Seller
-0.59
Tale
-0.58
quickShipAvailable
-0.57
Mandarin
-0.56
POSITIVE LOGITS
hooked
0.90
acquainted
0.77
ghan
0.75
nuts
0.74
agy
0.72
fooled
0.71
addicted
0.70
nuts
0.70
ander
0.69
deported
0.68
Activations Density 0.131%