INDEX
    Explanations

    phrases indicating causality

    the word "so" used as a connector or transition in sentences

    New Auto-Interp
    Negative Logits
    mast
    -0.62
     Mens
    -0.61
    ammy
    -0.59
    inch
    -0.59
    女
    -0.58
     Wer
    -0.56
     Halls
    -0.56
    kb
    -0.55
     silhouette
    -0.53
     Souls
    -0.53
    POSITIVE LOGITS
    bered
    1.22
    oner
    1.14
    othe
    1.14
    apy
    1.12
    othes
    0.99
    aps
    0.92
    oooo
    0.88
    arin
    0.87
    ooo
    0.85
    oths
    0.82
    Act Density 0.067%

    No Known Activations