INDEX
    Explanations

    phrases indicating preparation or readiness for a task

    New Auto-Interp
    Negative Logits
    çµIJ
    -0.17
    nette
    -0.15
    utations
    -0.13
    cum
    -0.13
    cf
    -0.13
    ox
    -0.13
    ió
    -0.13
    kup
    -0.13
     Kat
    -0.13
    acker
    -0.13
    POSITIVE LOGITS
     next
    0.45
     now
    0.42
    now
    0.37
    next
    0.37
     ÑĤепеÑĢÑĮ
    0.36
     Next
    0.36
     Now
    0.36
    Next
    0.35
     NEXT
    0.35
    Now
    0.33
    Act Density 0.208%

    No Known Activations