INDEX
Explanations
the beginning of sentences or paragraphs
New Auto-Interp
Negative Logits
bootstrapcdn
-1.02
kasarigan
-0.90
IFORN
-0.74
########.
-0.73
Anſ
-0.71
Soph
-0.68
ſever
-0.67
Transc
-0.66
PMailer
-0.65
تقاوى
-0.64
POSITIVE LOGITS
متعلقه
0.66
strijd
0.58
<bos>
0.58
Juifs
0.55
↵
0.54
peines
0.51
informací
0.51
</td>
0.51
legyen
0.50
</h4>
0.50
Activations Density 0.036%