INDEX
    Explanations

    instances of the word "that" indicating relevance or specification

    the occurrence of empty or incomplete text segments

    New Auto-Interp
    Negative Logits
    EMBER
    -0.52
    uty
    -0.49
    anton
    -0.46
    polit
    -0.43
    eely
    -0.43
     Corps
    -0.43
    omet
    -0.42
    bern
    -0.41
    roth
    -0.41
    ucc
    -0.40
    POSITIVE LOGITS
     doesnt
    0.83
     translates
    0.81
     consists
    0.78
     lasts
    0.78
     includes
    0.77
     involves
    0.77
     utilizes
    0.76
     lasted
    0.76
     resembles
    0.75
     sucks
    0.75
    Act Density 0.073%

    No Known Activations