INDEX
Explanations
phrases indicating accountability and responsibility, particularly in relation to law enforcement actions
conjunctions and transitions in sentences
New Auto-Interp
Negative Logits
SourceFile
-0.87
ULAR
-0.80
ãĤ¼ãĤ¦ãĤ¹
-0.76
oodle
-0.68
Widget
-0.66
olves
-0.65
pecially
-0.65
MpServer
-0.65
arah
-0.64
ãĤ©
-0.64
POSITIVE LOGITS
unlike
1.16
despite
1.02
there
0.97
according
0.96
contrary
0.93
although
0.92
owing
0.90
whereas
0.88
none
0.87
insofar
0.87
Activations Density 0.167%