INDEX
    Explanations

    words related to upward movement or direction

    occurrences of the word "up."

    New Auto-Interp
    Negative Logits
    âĸ¬âĸ¬
    -0.65
    mia
    -0.59
     Emanuel
    -0.58
    士
    -0.58
    Synopsis
    -0.58
     understatement
    -0.58
     scapego
    -0.57
     keyword
    -0.56
    Closure
    -0.56
    ¯¯¯¯
    -0.55
    POSITIVE LOGITS
    stairs
    1.10
    rights
    1.02
    river
    0.97
    stage
    0.89
     stairs
    0.87
    raised
    0.86
    ris
    0.84
    np
    0.81
    ornia
    0.81
    erd
    0.80
    Act Density 0.077%

    No Known Activations