INDEX
Explanations
references to the nation's well-being or concerns
New Auto-Interp
Negative Logits
rud
-0.17
Bethlehem
-0.16
inkle
-0.16
ocha
-0.15
pais
-0.14
IGIN
-0.14
enty
-0.14
los
-0.14
peg
-0.14
iker
-0.14
POSITIVE LOGITS
ATAB
0.15
aname
0.14
λεκ
0.14
isel
0.14
esson
0.14
azes
0.14
URA
0.13
оÑĢо
0.13
HttpRequest
0.13
setState
0.13
Activations Density 0.039%