INDEX
Explanations
sentences that express regret or seek retraction
Follows quotation marks or dialogue opening
offer/request to do something
New Auto-Interp
Negative Logits
"
-0.84
(&
-0.78
&
-0.77
)&
-0.68
("-0.66
ujednoznacz
-0.66
-0.65
("%-0.64
"/
-0.63
"#
-0.63
POSITIVE LOGITS
—”
1.90
-”
1.66
—"
1.52
…”
1.47
——”
1.45
-“
1.34
-"
1.32
—“
1.31
--"
1.31
...”
1.31
Activations Density 0.296%