INDEX
Explanations
elements related to openings or initiations, particularly in a procedural or sequential context
Prepositions followed by "the"
prepositions followed by articles
New Auto-Interp
Negative Logits
<unused52>
-0.73
<unused42>
-0.73
<unused17>
-0.72
<unused14>
-0.72
<unused74>
-0.72
<unused79>
-0.72
<unused3>
-0.72
<unused8>
-0.72
[@BOS@]
-0.72
<pad>
-0.72
POSITIVE LOGITS
the
0.42
}
0.31
}{0.29
Sow
0.29
Sm
0.28
Sw
0.27
Ста
0.27
Gal
0.27
Augen
0.27
—
0.26
Activations Density 0.548%