INDEX
Explanations
references to governmental policies and their implications in societal contexts
New Auto-Interp
Negative Logits
ÅĦ
-0.15
ieber
-0.15
å´
-0.15
ari
-0.14
iem
-0.14
ould
-0.14
imeo
-0.14
леменÑĤ
-0.14
OrDefault
-0.14
fter
-0.14
POSITIVE LOGITS
vak
0.17
.da
0.17
anter
0.16
.tie
0.15
ADX
0.14
unal
0.14
ayette
0.14
ç¢
0.14
ilha
0.14
Ying
0.14
Activations Density 1.806%