INDEX
    Explanations

    specific proper nouns, particularly names and locations associated with scientific or academic contexts

    New Auto-Interp
    Negative Logits
    <bos>
    -0.58
     deleteById
    -0.57
    -------
    -0.52
    vscode
    -0.50
     Agassi
    -0.48
    Luigi
    -0.47
    ↵↵↵
    -0.46
    -
    -0.46
    IContainer
    -0.44
    <eos>
    -0.44
    POSITIVE LOGITS
    BibitemShut
    0.66
    arangay
    0.58
    rsiniz
    0.56
     fört
    0.55
    zionalità
    0.55
    rasında
    0.54
     CET
    0.54
    pulseira
    0.54
     Sentry
    0.51
    attutto
    0.51
    Act Density 0.200%

    No Known Activations