INDEX
    Explanations

    entities like names, titles or organizations in a specific format

    punctuation marks and their contexts within citations and quotations

    New Auto-Interp
    Negative Logits
    lling
    -0.74
    bably
    -0.69
    arer
    -0.67
     sulph
    -0.65
     inadequ
    -0.63
     forgotten
    -0.63
     lifes
    -0.63
    theless
    -0.63
    oeuv
    -0.63
     concentration
    -0.63
    POSITIVE LOGITS
    BRE
    0.93
    KT
    0.90
    FOX
    0.89
    SCP
    0.88
    ................................................................
    0.86
    Anonymous
    0.86
    WF
    0.86
    Screenshot
    0.85
    WARNING
    0.85
    GREEN
    0.83
    Act Density 0.131%

    No Known Activations