INDEX
Explanations
punctuation marks, particularly periods and question marks
Comes before "By" or punctuation
concluding phrases
New Auto-Interp
Negative Logits
didSet
-0.98
estekak
-0.88
ffilmiau
-0.85
Wiktionnaire
-0.85
providedIn
-0.84
ITERATURE
-0.83
BoxFit
-0.83
ReusableCell
-0.82
Vikipedi
-0.81
ﷺ
-0.80
POSITIVE LOGITS
↵↵
0.87
↵
0.62
ちなみに
0.49
Furthermore
0.48
↵↵↵
0.47
0.47
Furthermore
0.45
Ultimately
0.43
Ultimately
0.43
0.42
Activations Density 0.793%