INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     prune
    0.69
    ngthen
    0.66
    }={\
    0.65
     hàn
    0.65
    decre
    0.62
     diminue
    0.62
     కాదు
    0.62
    }{(
    0.61
    เล็ก
    0.61
     vår
    0.60
    POSITIVE LOGITS
     اینکه
    0.73
     mengenai
    0.70
     this
    0.68
     ascertaining
    0.67
     into
    0.64
     möglichen
    0.64
     skateboarding
    0.64
     comprehensively
    0.63
     various
    0.63
     understanding
    0.63
    Act Density 0.134%

    No Known Activations