INDEX
    Explanations

    mathematical expressions involving x

    New Auto-Interp
    Negative Logits
    friend
    0.76
    wow
    0.75
    seeing
    0.71
    これで
    0.71
    Friend
    0.70
    number
    0.69
    gent
    0.67
    thumb
    0.66
    fresh
    0.65
    woman
    0.64
    POSITIVE LOGITS
     маши
    0.83
     pelo
    0.69
     וב
    0.68
     правил
    0.67
     sals
    0.66
     macer
    0.66
     ordinateur
    0.64
     científicos
    0.64
    ра
    0.64
     چلے
    0.64
    Act Density 0.114%

    No Known Activations