INDEX
    Explanations

    significant mentions of numerical values or statistics

    New Auto-Interp
    Negative Logits
     January
    -0.16
    readcr
    -0.16
    bye
    -0.14
    áŀ¶
    -0.14
    pak
    -0.14
     February
    -0.14
     July
    -0.14
    PlainText
    -0.14
     à¤ľà¤¨à¤µà¤°
    -0.13
     December
    -0.13
    POSITIVE LOGITS
    201
    0.28
    202
    0.26
    200
    0.23
     
    0.20
    199
    0.18
    000
    0.18
    æĺŁæľŁ
    0.18
    ĥĿ
    0.17
    197
    0.16
    198
    0.15
    Act Density 0.058%

    No Known Activations