INDEX
Explanations
prominent individuals and their affiliations in a professional context
New Auto-Interp
Negative Logits
ông
-0.17
ácil
-0.16
åĽ
-0.16
aits
-0.14
danmark
-0.13
ỹ
-0.13
oner
-0.13
Opens
-0.13
ansk
-0.13
.opend
-0.13
POSITIVE LOGITS
earned
0.29
joined
0.28
received
0.25
earned
0.24
grew
0.23
joined
0.23
rejo
0.22
works
0.22
graduated
0.22
began
0.21
Activations Density 0.103%