INDEX
Explanations
pronouns or possessive determiners used to refer to a specific person or group
possessive pronouns and expressions of ownership or reference
New Auto-Interp
Negative Logits
lished
-0.83
-+-+
-0.82
tu
-0.79
alde
-0.78
pps
-0.74
Lago
-0.73
ellen
-0.72
oak
-0.71
llah
-0.70
nell
-0.69
POSITIVE LOGITS
cues
1.31
cue
1.28
chances
1.06
rightful
1.06
oath
0.95
plunge
0.93
frustrations
0.91
own
0.89
bearings
0.88
pulse
0.87
Activations Density 0.034%