INDEX
    Explanations

    words indicating comparisons or evaluations of states, conditions, or entities

    New Auto-Interp
    Negative Logits
    overe
    -0.17
     ΣÏħ
    -0.16
    )prepare
    -0.15
    ith
    -0.15
    utron
    -0.15
    ioni
    -0.15
    owell
    -0.14
    ornings
    -0.14
    jal
    -0.14
     gord
    -0.14
    POSITIVE LOGITS
    .GroupLayout
    0.16
     AAA
    0.15
    LAB
    0.14
    edia
    0.14
     Sullivan
    0.14
    minus
    0.14
    CTR
    0.14
    ITED
    0.14
    Donate
    0.14
    entious
    0.14
    Act Density 0.005%

    No Known Activations