INDEX
    Explanations

    phrases related to personal experiences or stories

    New Auto-Interp
    Negative Logits
     Politica
    -0.87
     masaj
    -0.86
     fch
    -0.86
     Confe
    -0.84
     gubern
    -0.83
     toscana
    -0.82
     hcm
    -0.82
     Olimpia
    -0.81
     Simult
    -0.81
     quelquefois
    -0.79
    POSITIVE LOGITS
     d
    0.69
    d
    0.69
     gdyby
    0.63
     raczej
    0.59
     jakby
    0.56
    )_/¯
    0.54
     gotta
    0.53
     hadn
    0.53
    xbd
    0.52
    /*
    0.51
    Act Density 0.073%

    No Known Activations