INDEX
Explanations
references to written works or significant reports
Text quotations, citations, or references
written statements or quotes
New Auto-Interp
Negative Logits
<eos>
-0.57
']))
-0.51
"));
-0.44
UOUS
-0.42
});
-0.41
ugas
-0.41
"))
-0.41
prech
-0.40
Toolkit
-0.40
WriteTagHelper
-0.39
POSITIVE LOGITS
Quote
1.01
quote
0.91
Цитата
0.89
引用
0.83
Geplaatst
0.83
Quote
0.82
QUOTE
0.80
Wrote
0.79
quoted
0.79
wrote
0.79
Activations Density 0.071%