INDEX
    Explanations

    instances of the word "further" and its various forms, indicating a focus on progression or additional information

    New Auto-Interp
    Negative Logits
    run
    -0.16
    uno
    -0.16
    rum
    -0.16
    undo
    -0.16
    eln
    -0.15
    ernen
    -0.15
    vak
    -0.15
     hiá»ĥm
    -0.15
    uri
    -0.15
    ified
    -0.15
    POSITIVE LOGITS
    ance
    0.30
     ado
    0.26
    -reaching
    0.24
    most
    0.23
    hin
    0.23
    ing
    0.23
     than
    0.22
    -than
    0.22
    MORE
    0.19
    -more
    0.19
    Act Density 0.023%

    No Known Activations