INDEX
    Explanations

    Italian and Spanish words and names

    New Auto-Interp
    Negative Logits
    etheless
    -0.77
    INGTON
    -0.76
    interrupted
    -0.70
    oola
    -0.69
    iaries
    -0.69
    orage
    -0.67
    hips
    -0.67
    ourcing
    -0.64
    otos
    -0.64
    abies
    -0.64
    POSITIVE LOGITS
    lla
    1.51
    ller
    1.42
    llers
    1.42
    lli
    1.39
    lda
    1.37
    gger
    1.31
    zza
    1.31
    xt
    1.31
    lling
    1.31
    ppo
    1.26
    Act Density 4.423%

    No Known Activations