INDEX
    Explanations

    occurrences of the word "out" and its context within phrases

    New Auto-Interp
    Negative Logits
    ه‌اند
    -0.54
     FBref
    -0.45
    seau
    -0.42
    niów
    -0.42
    ientras
    -0.42
     noDo
    -0.41
     către
    -0.40
     kön
    -0.40
     périph
    -0.40
    ktır
    -0.40
    POSITIVE LOGITS
     emerged
    0.66
     emerge
    0.66
     emerges
    0.59
     emerging
    0.58
     out
    0.53
    ImageContext
    0.53
     emergence
    0.51
     emergent
    0.51
     COME
    0.51
     come
    0.50
    Act Density 0.007%

    No Known Activations