INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     minimizing
    -0.08
     distances
    -0.07
     decorated
    -0.07
    gger
    -0.07
    +'\
    -0.07
    eters
    -0.07
    'n
    -0.07
    kın
    -0.07
    okie
    -0.07
     phức
    -0.07
    POSITIVE LOGITS
     epid
    0.35
     Epid
    0.18
    0.07
     descend
    0.07
     epidemic
    0.06
     predic
    0.06
    Pid
    0.06
    _epi
    0.06
     MODE
    0.06
    pid
    0.06
    Act Density 0.001%

    No Known Activations