INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     shafts
    -0.09
     shaft
    -0.08
    -0.08
     tweets
    -0.07
     kwart
    -0.07
    (Exception
    -0.07
    slider
    -0.07
    axes
    -0.07
     trembling
    -0.07
     reportage
    -0.07
    POSITIVE LOGITS
     cozinha
    0.08
    0.08
     خدا
    0.08
    oops
    0.08
    .hibernate
    0.08
    -ọ
    0.07
     stole
    0.07
    .springframework
    0.07
     значит
    0.07
    meaning
    0.07
    Act Density 0.073%

    No Known Activations