INDEX
    Explanations

    verbs related to actions or events in the past that involve some form of communication or decision-making

    New Auto-Interp
    Negative Logits
    adra
    -0.76
    enegger
    -0.73
    atever
    -0.71
    stood
    -0.68
    sit
    -0.68
    neath
    -0.64
    acebook
    -0.63
    farious
    -0.62
    omever
    -0.61
    ankind
    -0.60
    POSITIVE LOGITS
    own
    0.65
     aback
    0.61
     ]
    0.61
    monton
    0.59
     >>
    0.58
     =====
    0.57
     ><
    0.56
    âĦ¢:
    0.55
     )]
    0.55
    ream
    0.54
    Act Density 0.155%

    No Known Activations