INDEX
Explanations
connections between social issues and governmental responses
New Auto-Interp
Negative Logits
è¿Ļç§į
-0.16
het
-0.14
urs
-0.14
ZERO
-0.13
ache
-0.12
stanov
-0.12
mpar
-0.12
ìĦ±ìĿĦ
-0.12
2
-0.12
operated
-0.12
POSITIVE LOGITS
economics
0.23
optics
0.23
timing
0.22
location
0.21
whether
0.20
money
0.20
who
0.20
perception
0.20
technique
0.20
attitude
0.20
Activations Density 0.663%