INDEX
Explanations
short phrases of reported speech or quotes
phrases that indicate attribution or citation of statements
New Auto-Interp
Negative Logits
ļéĨĴ
-1.00
¬¼
-0.94
displayText
-0.86
»Ĵ
-0.83
ģ«
-0.83
ĻĤ
-0.82
£ı
-0.79
ħĭ
-0.79
etheless
-0.78
ĪĴ
-0.74
POSITIVE LOGITS
explains
1.67
says
1.62
admits
1.34
explained
1.32
observes
1.30
recalls
1.25
concedes
1.24
warns
1.21
adds
1.20
argues
1.18
Activations Density 0.135%