INDEX
Explanations
personal pronouns indicating possession
New Auto-Interp
Negative Logits
etheless
-0.80
IVERS
-0.67
andel
-0.58
amily
-0.57
Jacobs
-0.53
cancellation
-0.53
NM
-0.52
mary
-0.52
Physicians
-0.50
Nichols
-0.50
POSITIVE LOGITS
/
1.29
panic
1.22
/
1.05
Majesty
1.05
or
0.96
/_
0.92
Honour
0.90
/,
0.89
/.
0.89
/#
0.88
Activations Density 0.229%