INDEX
Explanations
references to political decisions and their consequences
New Auto-Interp
Negative Logits
ctp
-0.16
Tube
-0.14
zung
-0.14
amera
-0.14
adius
-0.13
èĴ
-0.13
tube
-0.13
tubes
-0.13
.cg
-0.13
xad
-0.13
POSITIVE LOGITS
RSS
0.34
Hind
0.32
rss
0.27
RSS
0.25
communal
0.24
AIM
0.24
Ram
0.23
Hindus
0.23
/rss
0.23
hind
0.23
Activations Density 0.068%