INDEX
Explanations
information about individuals' professional roles and backgrounds
New Auto-Interp
Negative Logits
sponsoring
-0.15
iffe
-0.15
1
-0.14
Alone
-0.14
sponsorship
-0.14
ÑģиÑĤ
-0.14
Phen
-0.14
[]{↵-0.14
Rounds
-0.14
[]
-0.13
POSITIVE LOGITS
covering
0.23
writing
0.20
Reporting
0.19
covers
0.19
åĨĻ
0.19
reporting
0.18
covers
0.18
covering
0.18
coverage
0.18
æĬ¥éģĵ
0.18
Activations Density 0.116%