INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Questionnaire
    -0.08
     скоро
    -0.08
     creativo
    -0.08
     Learned
    -0.08
     Creat
    -0.08
     vlast
    -0.08
     Tmp
    -0.08
    ateur
    -0.08
     ylabel
    -0.08
     workmanship
    -0.08
    POSITIVE LOGITS
     commercial
    0.09
     platforms
    0.09
    -scale
    0.08
     kakhulu
    0.08
    0.08
    -tier
    0.08
     ביותר
    0.08
     plataformas
    0.08
     крупных
    0.08
    ‌تر
    0.08
    Act Density 0.079%

    No Known Activations