INDEX
Explanations
information related to news, politics, and crime
references to legal or political issues
New Auto-Interp
Negative Logits
)."
-0.70
arger
-0.67
thumbnails
-0.60
)).
-0.59
.).
-0.57
âķIJâķIJ
-0.57
UNCLASSIFIED
-0.55
lihood
-0.54
CONCLUS
-0.54
doi
-0.53
POSITIVE LOGITS
âĢİ
0.60
Posted
0.58
ONDON
0.57
Basics
0.56
umbai
0.56
Belfast
0.56
][
0.55
tymology
0.53
horny
0.52
defenseman
0.51
Activations Density 0.777%