INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     candidature
    -0.08
     wirtschaft
    -0.08
     candidatura
    -0.08
    -0.08
     Expand
    -0.08
     coher
    -0.08
     배열
    -0.07
     finales
    -0.07
     후보
    -0.07
     protr
    -0.07
    POSITIVE LOGITS
     traditions
    0.10
    0.09
    0.09
     Tradition
    0.09
    Dialect
    0.09
     taş
    0.08
     tradition
    0.08
     dialect
    0.08
    Ubuntu
    0.08
    0.08
    Act Density 0.004%

    No Known Activations