INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     }>↵
    -0.07
    ufreq
    -0.07
    elier
    -0.07
    _rot
    -0.07
     <<↵
    -0.07
    atomic
    -0.07
     Bare
    -0.06
     Backend
    -0.06
    memiş
    -0.06
    راف
    -0.06
    POSITIVE LOGITS
     vision
    0.21
     Vision
    0.19
    Vision
    0.17
    vision
    0.12
     visions
    0.11
    ision
    0.10
    VISION
    0.10
     Mission
    0.08
     envis
    0.08
     plans
    0.08
    Act Density 0.006%

    No Known Activations