INDEX
    Explanations

    people describing their background and experiences

    New Auto-Interp
    Negative Logits
    jandro
    -0.55
     malheureux
    -0.50
    stasia
    -0.50
    )>=
    -0.48
    heran
    -0.47
     berken
    -0.47
    eckel
    -0.46
     makam
    -0.46
    ikyuu
    -0.46
    侵略
    -0.46
    POSITIVE LOGITS
     myself
    0.74
    Sebagai
    0.64
    <bos>
    0.62
    GEBURTSDATUM
    0.61
    Setiap
    0.60
    Bukan
    0.59
    álbum
    0.58
    expandindo
    0.56
    фициальный
    0.56
    Sklici
    0.56
    Act Density 0.550%

    No Known Activations