INDEX
    Explanations

    terms related to movement or progress

    instances of the word "go" in various contexts

    New Auto-Interp
    Negative Logits
    lihood
    -0.75
    lished
    -0.73
     Advertisement
    -0.71
    alia
    -0.69
    stakes
    -0.66
     Responsibility
    -0.64
    expression
    -0.62
    andestine
    -0.61
    vich
    -0.60
    majority
    -0.59
    POSITIVE LOGITS
     go
    2.97
    Go
    1.83
    go
    1.81
     Go
    1.75
     goes
    1.61
     GO
    1.56
     proceed
    1.42
     went
    1.38
     gone
    1.36
     stay
    1.25
    Act Density 0.045%

    No Known Activations