INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.08
     degr
    -0.08
     Calidad
    -0.08
    тора
    -0.08
     hut
    -0.08
     Verkehrs
    -0.08
     Klang
    -0.07
    ניה
    -0.07
     Schul
    -0.07
    odeled
    -0.07
    POSITIVE LOGITS
     courage
    0.14
     coragem
    0.14
     bravery
    0.13
     courageous
    0.12
    0.11
    fulness
    0.10
     daring
    0.10
     overcoming
    0.09
     perseverance
    0.09
     brav
    0.09
    Act Density 0.022%

    No Known Activations