INDEX
Explanations
mentions of government officials and their titles
New Auto-Interp
Negative Logits
kip
-0.15
Siz
-0.14
渡
-0.14
usage
-0.14
/rss
-0.14
Gle
-0.14
hap
-0.13
oth
-0.13
Inspector
-0.13
ker
-0.13
POSITIVE LOGITS
jid
0.15
eget
0.15
ieren
0.15
Education
0.14
amilia
0.14
лоп
0.14
REW
0.14
inters
0.13
veter
0.13
quiv
0.13
Activations Density 0.019%