INDEX
Explanations
individuals and their affiliations or roles
instances of the word "who" in relation to individuals and their descriptions or roles
New Auto-Interp
Negative Logits
economical
-0.67
misogyn
-0.67
Georg
-0.64
arbitrary
-0.64
urg
-0.63
destruct
-0.63
³³³³
-0.62
logical
-0.62
Failure
-0.61
Nut
-0.59
POSITIVE LOGITS
oversaw
1.20
oversees
1.16
participated
1.07
attended
1.06
owns
1.05
attends
1.02
specializes
1.02
specialize
1.01
authored
0.96
chaired
0.95
Activations Density 0.100%