INDEX
    Explanations

    personal pronouns related to oneself

    New Auto-Interp
    Negative Logits
    <bos>
    -2.88
     springfox
    -0.77
     guma
    -0.69
    EndProject
    -0.68
    lateinit
    -0.66
    
    
    -0.66
     jakarta
    -0.65
     säkert
    -0.62
    /**
    -0.61
    </tbody>
    -0.58
    POSITIVE LOGITS
     disreg
    1.23
     unlaw
    1.18
     malheure
    1.16
     habile
    1.15
     véhic
    1.15
     shenan
    1.15
     héro
    1.10
     effray
    1.08
     accla
    1.07
     expéri
    1.05
    Act Density 0.117%

    No Known Activations