INDEX
    Explanations

    phrases related to society, responsibility, and duty

    New Auto-Interp
    Negative Logits
     ftu
    -1.36
     haer
    -1.29
     lele
    -1.28
     ftate
    -1.27
     ftre
    -1.27
     paff
    -1.23
     ufe
    -1.23
     magis
    -1.22
     fta
    -1.22
     vns
    -1.19
    POSITIVE LOGITS
     therefore
    0.92
     hence
    0.81
     this
    0.79
     thats
    0.79
     consequently
    0.76
     thus
    0.75
     it
    0.75
     if
    0.74
     yet
    0.74
     that
    0.73
    Act Density 0.246%

    No Known Activations