INDEX
    Explanations

    contractions of the form "verb + 's", typically indicating possession or omission of a letter

    possessive constructions and questions about different subjects or topics

    New Auto-Interp
    Negative Logits
    velop
    -0.70
    oak
    -0.62
    igraph
    -0.62
    onent
    -0.60
    sha
    -0.60
    UME
    -0.59
    erers
    -0.58
    cember
    -0.57
     wards
    -0.57
    onz
    -0.56
    POSITIVE LOGITS
     happened
    0.97
     happening
    0.88
     gonna
    0.87
     transpired
    0.84
    pace
    0.79
     happ
    0.75
     gotta
    0.73
     Done
    0.71
     REALLY
    0.68
     done
    0.68
    Act Density 0.047%

    No Known Activations