INDEX
    Explanations

    words related to specific ways of doing things or actions

    phrases that describe modes or styles of behavior

    New Auto-Interp
    Negative Logits
     Pett
    -0.65
     Klux
    -0.62
    oret
    -0.61
    agra
    -0.61
    ophy
    -0.59
    icent
    -0.57
    onge
    -0.57
     Sisters
    -0.57
     Prediction
    -0.55
    agged
    -0.55
    POSITIVE LOGITS
     guiActiveUnfocused
    0.79
     whatsoever
    0.78
    manship
    0.76
     ashore
    0.73
     surrounded
    0.72
    places
    0.71
    leon
    0.70
     imaginable
    0.70
    smanship
    0.70
     resembling
    0.68
    Act Density 0.467%

    No Known Activations