INDEX
Explanations
references to individuals' careers and accomplishments in entertainment
New Auto-Interp
Negative Logits
artz
-0.18
yna
-0.17
ëĦ¤ìĿ´íĬ¸
-0.14
oding
-0.14
storybook
-0.14
_VERTEX
-0.14
ẩu
-0.14
Matchers
-0.14
orges
-0.14
ennes
-0.13
POSITIVE LOGITS
television
0.20
roles
0.20
acting
0.19
Acting
0.19
TV
0.19
appearances
0.19
Actors
0.17
credits
0.17
stage
0.17
acted
0.17
Activations Density 0.097%