INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    (element
    -0.08
     elemento
    -0.08
     товара
    -0.08
     گرفتن
    -0.08
    (target
    -0.08
     στοι
    -0.08
     incum
    -0.08
     عناصر
    -0.08
     shirts
    -0.08
    (elem
    -0.07
    POSITIVE LOGITS
     cérebro
    0.14
     cerveau
    0.14
     brains
    0.12
     мозга
    0.11
     neurons
    0.11
     brain
    0.11
     cortex
    0.10
     neuroscience
    0.10
     brian
    0.10
     wiring
    0.10
    Act Density 0.019%

    No Known Activations