INDEX
    Explanations

    elements related to openings or initiations, particularly in a procedural or sequential context

    Prepositions followed by "the"

    prepositions followed by articles

    New Auto-Interp
    Negative Logits
    <unused52>
    -0.73
    <unused42>
    -0.73
    <unused17>
    -0.72
    <unused14>
    -0.72
    <unused74>
    -0.72
    <unused79>
    -0.72
    <unused3>
    -0.72
    <unused8>
    -0.72
    [@BOS@]
    -0.72
    <pad>
    -0.72
    POSITIVE LOGITS
     the
    0.42
     }
    0.31
    }{
    0.29
     Sow
    0.29
     Sm
    0.28
     Sw
    0.27
    Ста
    0.27
     Gal
    0.27
     Augen
    0.27
    0.26
    Act Density 0.548%

    No Known Activations