INDEX
Explanations
names of individuals or characters and their related activities or roles
New Auto-Interp
Negative Logits
éĹ
-0.75
Writer
-0.63
Trave
-0.60
ishable
-0.59
igrants
-0.59
Guide
-0.58
=-=-=-=-=-=-=-=-
-0.58
raved
-0.58
Written
-0.58
olson
-0.57
POSITIVE LOGITS
replaced
0.65
adj
0.64
incidentally
0.64
ership
0.63
herself
0.62
Democr
0.61
notoriously
0.60
overpower
0.59
,,,,
0.59
receptive
0.58
Activations Density 1.104%