INDEX
    Explanations

    punctuation

    New Auto-Interp
    Negative Logits
     Rd
    -0.08
    repr
    -0.08
    -0.07
    talk
    -0.07
     Dans
    -0.07
     ele
    -0.07
    Think
    -0.07
     yaitu
    -0.07
    elan
    -0.07
     x
    -0.07
    POSITIVE LOGITS
     выяв
    0.10
     approving
    0.10
     выяс
    0.10
     confirming
    0.09
     feststellen
    0.09
     فإذا
    0.09
     ואם
    0.09
    确认
    0.09
     pinpoint
    0.09
     பார்க்க
    0.09
    Act Density 0.090%

    No Known Activations