INDEX
Explanations
references to specific organizations, names, or places involved in political or developmental contexts
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.05
3:0.16
4:0.02
5:0.07
6:0.01
7:0.04
8:0.03
9:0.01
10:0.48
11:0.03
Negative Logits
same
-2.21
sockets
-2.18
expecting
-2.11
mere
-2.06
Canaver
-2.05
contrary
-1.94
exclusive
-1.94
usual
-1.93
invitation
-1.93
hibited
-1.89
POSITIVE LOGITS
regain
3.16
overcome
2.98
improve
2.93
navigate
2.65
achieve
2.59
solve
2.56
recover
2.42
clarify
2.38
Improve
2.37
uncover
2.31
Activations Density 0.225%