INDEX
    Explanations

    instances of the word "when" signaling temporal references or conditions

    New Auto-Interp
    Negative Logits
    them
    -0.19
    ise
    -0.17
    orsi
    -0.16
    æģµ
    -0.16
    unately
    -0.15
    ucs
    -0.15
    ly
    -0.14
    ssi
    -0.14
    elerik
    -0.14
     olarak
    -0.14
    POSITIVE LOGITS
    /if
    0.45
    soever
    0.43
     they
    0.33
    EVER
    0.32
     we
    0.30
     faced
    0.29
     asked
    0.28
    -либо
    0.27
    /how
    0.26
     it
    0.26
    Act Density 0.137%

    No Known Activations