INDEX
    Explanations

    words related to legal or bureaucratic processes

    New Auto-Interp
    Negative Logits
     bourgeo
    -1.31
     sappi
    -1.31
     applau
    -1.26
     incess
    -1.26
     igno
    -1.17
     nutr
    -1.13
     emphat
    -1.12
     ordina
    -1.12
    ;;)
    -1.11
     simplif
    -1.10
    POSITIVE LOGITS
     there
    0.78
     although
    0.73
     it
    0.70
     while
    0.69
     we
    0.66
     they
    0.64
     the
    0.61
     if
    0.61
     despite
    0.60
     since
    0.60
    Act Density 0.245%

    No Known Activations