INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    factory
    -0.07
     nursing
    -0.07
    .Context
    -0.06
     staged
    -0.06
     lighting
    -0.06
     society
    -0.06
     budding
    -0.06
    Scaling
    -0.06
    .setting
    -0.06
     Nursing
    -0.06
    POSITIVE LOGITS
     چیز
    0.07
     сох
    0.06
    Carlos
    0.06
     Warn
    0.06
    0.06
    0.06
     insist
    0.06
    eco
    0.06
    Jan
    0.06
     attravers
    0.06
    Act Density 0.007%

    No Known Activations