INDEX
    Explanations

    punctuation marks and formatting symbols

    dialogue starting with quotes

    New Auto-Interp
    Negative Logits
     queſta
    -1.06
     autorytatywna
    -1.02
    queryInterface
    -0.94
     snippetHide
    -0.93
     <=",
    -0.93
     يتيمه
    -0.92
    ſſung
    -0.92
     الرياضيه
    -0.90
    <unused68>
    -0.90
    <unused41>
    -0.90
    POSITIVE LOGITS
    2
    0.48
    /*
    0.47
    After
    0.46
    1
    0.45
    3
    0.43
    The
    0.43
    8
    0.41
    You
    0.41
    I
    0.40
     I
    0.40
    Act Density 0.006%

    No Known Activations