INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     arrow
    -0.07
     trajectory
    -0.07
     FAIL
    -0.07
    ikel
    -0.07
    _yaw
    -0.07
     diagonal
    -0.07
    asal
    -0.07
    -0.07
     vector
    -0.07
     blade
    -0.07
    POSITIVE LOGITS
     org
    0.15
    org
    0.14
    Org
    0.12
    (org
    0.11
    ORG
    0.10
     Org
    0.10
    _org
    0.09
    -org
    0.09
    	org
    0.08
    OG
    0.08
    Act Density 0.011%

    No Known Activations