INDEX
Explanations
sentences related to legal or criminal activities
occurrences of commas in the text
New Auto-Interp
Negative Logits
");
-0.80
").
-0.77
');
-0.75
tsy
-0.70
"),
-0.60
.).
-0.60
esi
-0.59
Primary
-0.58
>.
-0.57
',"
-0.57
POSITIVE LOGITS
meanwhile
1.28
however
1.17
moreover
1.05
coupled
1.02
along
0.90
along
0.90
plus
0.81
combined
0.78
namely
0.75
unsurprisingly
0.75
Activations Density 0.398%