INDEX
    Explanations

    phrases indicating importance or significance

    the repeated use of the word "and" in various contexts

    New Auto-Interp
    Negative Logits
    LOCK
    -0.72
    YP
    -0.71
    Ĥİ
    -0.70
    Shut
    -0.69
    uta
    -0.66
    STE
    -0.66
    è»
    -0.65
    Cub
    -0.64
    lace
    -0.63
    Els
    -0.62
    POSITIVE LOGITS
     hence
    1.40
     consequently
    1.35
     therefore
    1.35
     thus
    1.23
     secondly
    1.13
     furthermore
    1.04
     thereby
    1.02
     moreover
    1.00
     then
    0.94
     although
    0.93
    Act Density 0.670%

    No Known Activations