INDEX
    Explanations

    instances of the word "like" in various contexts

    New Auto-Interp
    Negative Logits
    eyse
    -0.17
    istrovstvÃŃ
    -0.16
    åł
    -0.15
    atron
    -0.14
    mony
    -0.14
    kem
    -0.14
    ucu
    -0.14
    roles
    -0.14
    ÅĻed
    -0.14
    ÑĢемÑı
    -0.14
    POSITIVE LOGITS
    utta
    0.17
    Ñĥв
    0.15
     Erf
    0.15
    ë°
    0.15
    Äįek
    0.14
     Wrest
    0.14
    vern
    0.14
     Heller
    0.14
     Orr
    0.13
     اط
    0.13
    Act Density 0.040%

    No Known Activations