INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     bourgeoisie
    -0.06
     Integr
    -0.06
     influenza
    -0.06
     облі
    -0.06
     Aj
    -0.06
     PLATFORM
    -0.06
     gee
    -0.06
    .Microsoft
    -0.06
    iators
    -0.06
     правил
    -0.06
    POSITIVE LOGITS
     dogs
    0.07
     pets
    0.07
     Dresden
    0.07
     kıs
    0.07
     retreat
    0.07
    zan
    0.06
    EY
    0.06
     diag
    0.06
     chant
    0.06
     Como
    0.06
    Act Density 0.015%

    No Known Activations