INDEX
Explanations
mentions of specific roles or positions held by individuals
instances of the article "a" followed by numerical expressions or descriptors of roles, ages, or experiences
New Auto-Interp
Negative Logits
ðĿ
-0.76
tons
-0.74
imates
-0.73
eworks
-0.71
iths
-0.69
encies
-0.67
NetMessage
-0.67
ayers
-0.66
ency
-0.66
ivable
-0.65
POSITIVE LOGITS
teenager
1.34
youngster
1.02
kid
1.02
child
1.01
spectator
0.99
prisoner
0.97
clerk
0.96
waiter
0.96
consultant
0.95
youth
0.95
Activations Density 0.106%