INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Gewer
    -0.08
     veden
    -0.08
    -0.08
    eker
    -0.07
    "=>$
    -0.07
     unan
    -0.07
     andamento
    -0.07
    äller
    -0.07
    /Error
    -0.07
     escolhas
    -0.07
    POSITIVE LOGITS
     rights
    0.08
    -fetch
    0.08
    Allowed
    0.08
    Authorized
    0.08
    Thinking
    0.08
     roam
    0.08
     timeout
    0.08
    fold
    0.07
    ihkan
    0.07
    awasan
    0.07
    Act Density 0.001%

    No Known Activations