INDEX
    Explanations

    punctuation marks or periods in text

    New Auto-Interp
    Negative Logits
    edium
    -0.16
    kola
    -0.14
    lus
    -0.14
    inya
    -0.14
    usch
    -0.13
    oke
    -0.13
    efeller
    -0.13
    abad
    -0.13
    ARB
    -0.13
    bourne
    -0.13
    POSITIVE LOGITS
    prec
    0.15
     Lindsay
    0.15
    åİļ
    0.14
    exo
    0.14
     Prec
    0.14
    expo
    0.14
    yu
    0.14
    iku
    0.13
     Ivanka
    0.13
     itir
    0.13
    Act Density 0.031%

    No Known Activations