INDEX
    Explanations

    instances of the word "to" and its various forms in the context of potential actions or states

    New Auto-Interp
    Negative Logits
    Beam
    -0.19
    Beat
    -0.16
     Beam
    -0.16
     Beat
    -0.15
    ibu
    -0.14
     nám
    -0.13
    kaar
    -0.13
    olumn
    -0.13
    .scalablytyped
    -0.13
    ÑıÑģÑĮ
    -0.13
    POSITIVE LOGITS
     be
    1.18
    be
    0.63
     Be
    0.56
     باشد
    0.51
    	be
    0.48
    Be
    0.47
    _be
    0.45
    .be
    0.45
     seja
    0.43
     бÑĭÑĤÑĮ
    0.41
    Act Density 0.396%

    No Known Activations