INDEX
Explanations
references to policy-related organizations and their roles
New Auto-Interp
Negative Logits
inu
-0.17
aan
-0.15
mue
-0.15
.IContainer
-0.14
òi
-0.14
omitempty
-0.14
reas
-0.14
ecal
-0.14
jec
-0.13
omo
-0.13
POSITIVE LOGITS
organization
0.24
think
0.24
non
0.23
advocacy
0.22
non
0.21
umbrella
0.20
organization
0.20
adv
0.20
watchdog
0.19
nonprofit
0.19
Activations Density 0.057%