INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     constructed
    -0.07
     bark
    -0.07
     Floating
    -0.06
    cht
    -0.06
     خر
    -0.06
     Set
    -0.06
    (Constructor
    -0.06
     брос
    -0.06
     invariant
    -0.06
     insol
    -0.06
    POSITIVE LOGITS
     Media
    0.19
     media
    0.17
    Media
    0.17
    media
    0.15
     MEDIA
    0.13
    MEDIA
    0.13
    _media
    0.12
    .media
    0.11
    -media
    0.11
    (media
    0.11
    Act Density 0.014%

    No Known Activations