INDEX
    Explanations

    the word "almost" and its variations

    New Auto-Interp
    Negative Logits
    ะ
    -0.16
     пÑĥ
    -0.15
    orem
    -0.15
    rowsable
    -0.15
    виÑĩ
    -0.15
    utors
    -0.15
    оÑĢа
    -0.15
    oren
    -0.14
    ç¢
    -0.14
    Ïģε
    -0.14
    POSITIVE LOGITS
    ness
    0.17
    QUIRES
    0.16
    arda
    0.16
    arial
    0.16
    اÙģÙĩ
    0.15
    mente
    0.14
    Segoe
    0.14
    agher
    0.14
    ive
    0.14
    s
    0.14
    Act Density 0.041%

    No Known Activations