INDEX
    Explanations

    phrases indicating availability or readiness for action

    phrases indicating something is available or being offered

    New Auto-Interp
    Negative Logits
    cles
    -0.70
    ellen
    -0.68
    ACTED
    -0.66
    aan
    -0.66
    aghan
    -0.66
    sequently
    -0.65
    een
    -0.64
    leigh
    -0.64
    MJ
    -0.63
    ensen
    -0.62
    POSITIVE LOGITS
     instance
    0.83
     example
    0.81
    gery
    0.79
    going
    0.78
    geries
    0.78
     grabs
    0.78
    gotten
    0.71
     emergencies
    0.68
    Ĥª
    0.68
     starters
    0.67
    Act Density 0.116%

    No Known Activations