INDEX
    Explanations

    instances of the word "now" and its variations

    New Auto-Interp
    Negative Logits
     then
    -0.21
     otherwise
    -0.17
     ÑĤогда
    -0.16
    unate
    -0.16
    then
    -0.16
    still
    -0.16
     still
    -0.15
     poi
    -0.15
    	then
    -0.15
     entonces
    -0.15
    POSITIVE LOGITS
    adays
    0.22
    here
    0.20
    withstanding
    0.17
    HERE
    0.16
    ä¹İ
    0.15
    etten
    0.15
    zsche
    0.14
    theless
    0.14
    aring
    0.14
    uess
    0.14
    Act Density 0.028%

    No Known Activations