INDEX
    Explanations

    phrases related to general topics or concepts

    references to generalized concepts or items described as "things."

    New Auto-Interp
    Negative Logits
    inav
    -0.61
    avorite
    -0.60
     KM
    -0.57
    cul
    -0.56
     crest
    -0.55
    ynski
    -0.54
    ³³³³³³³³
    -0.54
     commentary
    -0.53
    ped
    -0.53
    onz
    -0.52
    POSITIVE LOGITS
    iverse
    1.36
     happened
    1.03
     happening
    1.00
     happen
    0.88
    ional
    0.88
     happens
    0.85
    Else
    0.83
    hots
    0.78
    ies
    0.77
     happ
    0.75
    Act Density 0.063%

    No Known Activations