INDEX
Explanations
references to political decisions and their consequences
New Auto-Interp
Negative Logits
rebel
-0.16
esini
-0.15
rolling
-0.15
rado
-0.15
ŀ
-0.14
zung
-0.14
hani
-0.14
borr
-0.14
anan
-0.14
yar
-0.14
POSITIVE LOGITS
RSS
0.28
rss
0.23
communal
0.22
RSS
0.22
Minority
0.21
conversions
0.19
rss
0.19
minorities
0.19
Minor
0.19
BJ
0.19
Activations Density 0.105%