INDEX
    Explanations

    the definite article "the" in various contexts

    New Auto-Interp
    Negative Logits
     but
    -0.33
    <%
    -0.31
    gefahr
    -0.29
     [
    -0.29
     combinations
    -0.28
     možná
    -0.27
     trips
    -0.27
     mat
    -0.27
     o
    -0.27
     it
    -0.26
    POSITIVE LOGITS
     Dieſe
    0.81
    <unused52>
    0.79
     メンテナ
    0.79
    [@BOS@]
    0.79
    <unused14>
    0.79
     erſten
    0.79
     fashiola
    0.79
    <unused41>
    0.79
    <pad>
    0.79
    <unused8>
    0.79
    Act Density 0.014%

    No Known Activations