INDEX
Explanations
mentions of criminal acts and investigations
elements related to crime and investigations
New Auto-Interp
Negative Logits
)."
-0.68
).[
-0.64
."[
-0.62
".[
-0.61
.""
-0.60
.'"
-0.59
.).
-0.57
'."
-0.57
]."
-0.56
cffffcc
-0.56
POSITIVE LOGITS
minist
0.50
?:
0.50
?",
0.47
esides
0.46
?
0.45
'?
0.44
ãĤ¼ãĤ¦ãĤ¹
0.42
Grimoire
0.42
!?
0.41
efe
0.40
Activations Density 2.428%