INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    domada
    -0.38
     میز
    -0.35
     Krieg
    -0.34
    Weiß
    -0.34
     smeared
    -0.33
     Weiß
    -0.33
    GetType
    -0.33
    CHARS
    -0.33
    rieta
    -0.33
    liches
    -0.32
    POSITIVE LOGITS
     shopping
    1.29
     Shopping
    1.23
    shopping
    1.18
    Shopping
    1.17
     SHOPPING
    1.16
     shopper
    1.03
     shop
    1.00
     SHOP
    0.98
     shops
    0.98
    Shop
    0.97
    Act Density 0.012%

    No Known Activations