INDEX
Explanations
punctuation marks that indicate emphasis or separation
New Auto-Interp
Negative Logits
widetilde
-0.75
fran
-0.73
un
-0.70
tps
-0.68
nungs
-0.66
Peters
-0.65
numRows
-0.62
olo
-0.62
chính
-0.62
capa
-0.61
POSITIVE LOGITS
$;
1.33
;;;
1.33
;;;;
1.28
icolon
1.17
_;
1.15
}$;
1.10
;;
1.10
?;
1.09
AndEndTag
1.09
+;
1.08
Activations Density 0.216%