INDEX
    Explanations

    instances of the word "open" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    rone
    -0.14
    zug
    -0.14
    ãģĵãĤĵ
    -0.14
    ROS
    -0.14
    rames
    -0.14
     neut
    -0.14
    thon
    -0.14
     ROLE
    -0.14
    enne
    -0.13
    getti
    -0.13
    POSITIVE LOGITS
     doors
    0.32
     Doors
    0.29
    /open
    0.28
     opened
    0.26
    doors
    0.24
     Pandora
    0.24
    (open
    0.23
     gates
    0.22
    -open
    0.22
     door
    0.22
    Act Density 0.089%

    No Known Activations