INDEX
    Explanations

    words prefixed with 'un'

    words and phrases that start with "un-"

    New Auto-Interp
    Negative Logits
     Ajax
    -0.70
     Cree
    -0.66
     FAR
    -0.66
     face
    -0.65
     Tut
    -0.65
     realism
    -0.65
     hinge
    -0.65
     rides
    -0.63
     fixtures
    -0.63
     drawer
    -0.62
    POSITIVE LOGITS
    assuming
    1.37
    cles
    1.37
    apolog
    1.34
    ruly
    1.34
    confirmed
    1.32
    spoken
    1.30
    numbered
    1.29
    character
    1.27
    anticipated
    1.27
    occupied
    1.27
    Act Density 0.026%

    No Known Activations