INDEX
Explanations
references to fear of arrest or restraint
concerns related to fear and legal issues
New Auto-Interp
Negative Logits
Originally
-0.50
nutshell
-0.49
âĢº
-0.49
pires
-0.48
refers
-0.45
inar
-0.43
xtap
-0.43
onnaissance
-0.43
·
-0.42
Deadline
-0.42
POSITIVE LOGITS
'."
0.93
)).
0.92
]."
0.87
.'"
0.85
.).
0.83
)."
0.80
'.
0.80
!".
0.77
%.
0.77
".
0.77
Activations Density 3.911%