INDEX
    Explanations

    instances of action words

    instances of the phrase "to do" indicating actions or tasks

    New Auto-Interp
    Negative Logits
    lights
    -0.80
    mare
    -0.71
    ussen
    -0.67
    pa
    -0.65
    wagen
    -0.64
    Reviewer
    -0.62
    tight
    -0.62
     Handling
    -0.62
    ware
    -0.61
    bane
    -0.59
    POSITIVE LOGITS
    omsday
    0.99
    pez
    0.98
    omething
    0.91
    ppel
    0.85
    lez
    0.78
    ggy
    0.78
    oms
    0.77
    ozy
    0.77
    lyak
    0.76
     something
    0.75
    Act Density 0.102%

    No Known Activations