INDEX
    Explanations

    references to different types of entities or classifications in a scientific context

    New Auto-Interp
    Negative Logits
     ÎĴ
    -0.21
     bacteria
    -0.20
    .BufferedReader
    -0.20
     Bishop
    -0.19
     bishop
    -0.19
     β
    -0.19
    .beta
    -0.18
     biology
    -0.18
    (binary
    -0.18
    @brief
    -0.17
    POSITIVE LOGITS
    unbind
    0.21
     worst
    0.20
    .unbind
    0.18
     Worst
    0.18
     unb
    0.17
     worse
    0.17
     sisters
    0.16
     foreground
    0.15
    wor
    0.15
    inqu
    0.15
    Act Density 0.291%

    No Known Activations