INDEX
    Explanations

    humility, learning, and risks

    New Auto-Interp
    Negative Logits
    :NS
    -0.09
     pseudo
    -0.08
     ứng
    -0.08
     carrying
    -0.08
     NS
    -0.08
    运输
    -0.08
     feu
    -0.08
     background
    -0.08
     transportar
    -0.07
     Background
    -0.07
    POSITIVE LOGITS
     humility
    0.12
     admitir
    0.11
     admitting
    0.10
     acknowledges
    0.10
     recept
    0.09
     receptive
    0.09
     admits
    0.09
     acknowledging
    0.09
    geschlossen
    0.08
     openness
    0.08
    Act Density 0.045%

    No Known Activations