INDEX
    Explanations

    phrases related to providing clarification or additional information

    instances of the word "To" followed by explanations or clarifications

    New Auto-Interp
    Negative Logits
     nets
    -0.68
     Appears
    -0.61
    dro
    -0.59
     forg
    -0.58
     diapers
    -0.57
    lot
    -0.55
    calling
    -0.54
    fires
    -0.54
     pic
    -0.53
    urgy
    -0.53
    POSITIVE LOGITS
    ilet
    1.45
     summarize
    1.24
     illustrate
    1.12
     reiterate
    1.11
    ppings
    1.10
     complicate
    1.10
    pping
    1.08
     clarify
    1.06
     compensate
    1.06
     commemorate
    1.02
    Act Density 0.044%

    No Known Activations