INDEX
Explanations
reporting speech or stating information
phrases indicating reported speech or quotations
New Auto-Interp
Negative Logits
Lens
-0.75
actly
-0.72
Hung
-0.71
Pont
-0.70
Enlarge
-0.66
pel
-0.65
kay
-0.64
joice
-0.63
à¦
-0.62
ãĥİ
-0.61
POSITIVE LOGITS
moreover
0.93
furthermore
0.91
secondly
0.81
afterward
0.72
afterwards
0.71
:"
0.69
therefore
0.67
bluntly
0.66
imaru
0.66
"[
0.64
Activations Density 0.168%