INDEX
Explanations
verbs or phrases indicating a sequence of actions or steps
phrases relating to actions taken by the subject
New Auto-Interp
Negative Logits
Cosponsors
-0.85
Nanto
-0.73
Volunteers
-0.72
Presbyter
-0.69
Pis
-0.68
pse
-0.66
contrasts
-0.65
Christy
-0.65
ccording
-0.65
Preservation
-0.64
POSITIVE LOGITS
uberty
0.96
rosis
0.84
rehend
0.83
irtual
0.81
haul
0.81
ihad
0.80
hran
0.80
acqu
0.79
pection
0.77
verified
0.77
Activations Density 0.349%