INDEX
    Explanations

    expressions related to moral evaluations and legality

    New Auto-Interp
    Negative Logits
    given
    -0.18
    Ñħи
    -0.18
    istrovstvÃŃ
    -0.17
     Given
    -0.16
     given
    -0.16
     Considering
    -0.16
     ><?
    -0.16
    eldom
    -0.16
    VF
    -0.15
     considering
    -0.15
    POSITIVE LOGITS
     because
    0.38
     BE
    0.36
     precisely
    0.33
    because
    0.31
     Because
    0.31
     porque
    0.31
    prec
    0.31
    Because
    0.30
     поÑĤомÑĥ
    0.29
     Prec
    0.28
    Act Density 0.147%

    No Known Activations