INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     glove
    -0.06
    #\
    -0.06
     міста
    -0.06
    Nous
    -0.06
    _exclude
    -0.06
    ictionary
    -0.06
    -ton
    -0.06
     κά
    -0.06
    От
    -0.06
    POSITIVE LOGITS
    .write
    0.08
    imers
    0.07
    ascii
    0.07
    ]:↵↵↵
    0.06
    ctrine
    0.06
     (:
    0.06
     `.
    0.06
    0.06
    (patient
    0.06
     widened
    0.06
    Act Density 0.001%

    No Known Activations