INDEX
    Explanations

    questions phrased as requests

    New Auto-Interp
    Negative Logits
    ","",
    1.47
     ','
    1.37
     ","
    1.35
    ,’’
    1.31
     ”,
    1.27
     ],"
    1.23
     )(
    1.21
    (),"
    1.20
     ’’
    1.20
     );//
    1.19
    POSITIVE LOGITS
    <start_of_image>
    2.21
    </h2>
    1.99
    </h1>
    1.59
    </h3>
    1.33
    </strong>
    1.28
    <strong>
    1.26
    <0x0D>
    1.24
    1.20
    </h4>
    1.14
     $\
    1.06
    Act Density 3.366%

    No Known Activations