INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     zoom
    -1.21
     Zoom
    -1.16
     ZOOM
    -1.07
    zoom
    -1.05
    Zoom
    -1.05
     zooming
    -0.99
     zoomed
    -0.91
    ZOOM
    -0.82
    ViewImports
    -0.73
    #+#
    -0.63
    POSITIVE LOGITS
    et
    0.69
    ito
    0.56
    ed
    0.51
    ir
    0.50
    ot
    0.50
    الحياه
    0.48
    ie
    0.46
    eta
    0.46
    out
    0.46
    tront
    0.46
    Act Density 0.010%

    No Known Activations