INDEX
    Explanations

    proper nouns, specifically names, especially the name "Shah"

    New Auto-Interp
    Negative Logits
    inary
    -0.70
     bunny
    -0.70
     omn
    -0.69
    lear
    -0.67
     auxiliary
    -0.66
     gradient
    -0.65
     sentient
    -0.64
    VILLE
    -0.62
     fiber
    -0.62
    boro
    -0.61
    POSITIVE LOGITS
     Shah
    4.00
     Sharif
    1.68
     Hussain
    1.59
     Khan
    1.53
     Zah
    1.48
     Mohammad
    1.48
     Sultan
    1.46
     Shia
    1.41
     Sheikh
    1.41
     Ahmad
    1.38
    Act Density 0.015%

    No Known Activations