INDEX
    Explanations

    references to scientific notation and figures in experimental data

    New Auto-Interp
    Negative Logits
     freaking
    -0.46
     autorytatywna
    -0.43
    WriteLiteral
    -0.42
    ActivityCompat
    -0.41
     FUCKING
    -0.40
    曖昧さ回避
    -0.40
     ͡°
    -0.40
     fucking
    -0.39
    ########.
    -0.39
     freakin
    -0.38
    POSITIVE LOGITS
     see
    0.79
    see
    0.75
     Fig
    0.67
    Fig
    0.66
     Figs
    0.61
     hereafter
    0.60
    cf
    0.59
     Supplementary
    0.59
    Figs
    0.59
     Appendix
    0.59
    Act Density 1.893%

    No Known Activations