INDEX
Explanations
phrases or sentences enclosed by quotation marks
quoted speech or dialogue
New Auto-Interp
Negative Logits
¾
-0.88
ĻĤ
-0.85
İĭ
-0.81
Ͻ
-0.80
¸
-0.80
²¾
-0.79
paren
-0.73
emis
-0.70
¥µ
-0.69
kell
-0.68
POSITIVE LOGITS
/"
0.86
sic
0.73
OTUS
0.71
["
0.67
Adds
0.65
remark
0.64
Donnell
0.63
justifies
0.62
Brien
0.62
into
0.60
Activations Density 0.100%