INDEX
    Explanations

    numerical values ending with '1' and sections with phrases that involve placing human lives at risk

    Arabic characters and symbols related to text encoding issues

    New Auto-Interp
    Negative Logits
    nect
    -0.97
    NetMessage
    -0.85
    istically
    -0.84
    */(
    -0.83
    onomy
    -0.80
    anooga
    -0.80
    fman
    -0.79
    glers
    -0.77
     hitch
    -0.77
    inately
    -0.76
    POSITIVE LOGITS
    ³
    0.89
    ת
    0.84
    à¥
    0.84
    ÙĦ
    0.82
    Į
    0.80
    ÙĨ
    0.78
    ——
    0.77
    ¸
    0.77
    Ö¼
    0.77
    ´
    0.77
    Act Density 0.009%

    No Known Activations