INDEX
Explanations
pronouns or noun phrases denoting a group of people
pronouns indicating individuals or groups
New Auto-Interp
Negative Logits
Peak
-0.72
tains
-0.64
Applications
-0.61
immunity
-0.60
pires
-0.60
Outside
-0.58
Gore
-0.56
premature
-0.56
Fail
-0.56
Unlimited
-0.55
POSITIVE LOGITS
'll
1.00
're
0.96
've
0.92
ngth
0.86
'd
0.82
bsite
0.81
ald
0.79
ggy
0.78
bart
0.76
'm
0.75
Activations Density 0.287%