INDEX
    Explanations

    questions seeking clarification or information

    New Auto-Interp
    Negative Logits
    oke
    -0.17
    ously
    -0.15
     Holt
    -0.14
     weren
    -0.14
    -switch
    -0.13
     åĨĨ
    -0.13
    tran
    -0.13
    ucha
    -0.12
    оÑĤÑĮ
    -0.12
    ADA
    -0.12
    POSITIVE LOGITS
    soever
    0.25
     exactly
    0.24
     shall
    0.24
     actually
    0.23
     exact
    0.19
     Shall
    0.19
     Exactly
    0.18
     do
    0.18
     should
    0.18
     are
    0.18
    Act Density 0.127%

    No Known Activations