INDEX
Explanations
occurrences of specific keywords and phrases related to current events and legislation
New Auto-Interp
Negative Logits
amb
-0.18
umas
-0.14
ob
-0.13
trình
-0.13
Hispan
-0.13
.netty
-0.13
pag
-0.13
à¹Ĥà¸Ĺ
-0.13
erging
-0.13
/msg
-0.13
POSITIVE LOGITS
ppers
0.15
roid
0.14
ieder
0.14
åĩºåĵģ
0.14
riority
0.14
istrovstvÃŃ
0.14
ÙģØ§Ø¹
0.14
Pale
0.14
IID
0.14
ILD
0.13
Activations Density 0.174%