INDEX
    Explanations

    phrases indicating a decision or action step being taken

    the word "so" used to indicate purpose or consequence

    New Auto-Interp
    Negative Logits
     glances
    -0.62
    ãģ£
    -0.57
    wreck
    -0.57
     silhouette
    -0.57
     Mens
    -0.56
     disbelief
    -0.56
     gallery
    -0.56
     denial
    -0.55
     realities
    -0.55
     outline
    -0.54
    POSITIVE LOGITS
    bered
    1.24
    othes
    1.13
    apy
    1.10
    oths
    1.08
    othe
    1.07
    aps
    0.99
    oner
    0.98
    oooo
    0.90
    arer
    0.90
    arin
    0.87
    Act Density 0.092%

    No Known Activations