INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     vision
    -2.02
    vision
    -1.79
     Vision
    -1.66
    Vision
    -1.59
     VISION
    -1.42
     visions
    -1.30
    VISION
    -1.27
    visions
    -1.24
     visión
    -1.14
    chamber
    -1.12
    POSITIVE LOGITS
    ing
    0.92
    ally
    0.87
    ed
    0.77
    ary
    0.77
    ised
    0.74
    al
    0.72
    aire
    0.69
    a
    0.66
    ی
    0.66
    naire
    0.66
    Act Density 0.198%

    No Known Activations