INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Sesso
    -0.08
    -0.07
     Jackets
    -0.07
    sed
    -0.07
     pot
    -0.07
     cast
    -0.06
    Aspect
    -0.06
     bolt
    -0.06
     bonding
    -0.06
     sed
    -0.06
    POSITIVE LOGITS
     University
    0.16
     university
    0.13
    University
    0.12
     Univ
    0.11
     universities
    0.11
     Universities
    0.10
    iversity
    0.09
     UNIVERSITY
    0.09
    voor
    0.09
     навч
    0.08
    Act Density 0.032%

    No Known Activations