INDEX
    Explanations

    structured information such as segment titles and section headings

    specific structured data or formatting markers

    New Auto-Interp
    Negative Logits
     neighb
    -0.71
     predec
    -0.68
     Vaugh
    -0.61
     administ
    -0.59
     surpr
    -0.59
     destro
    -0.59
     manif
    -0.58
     tradem
    -0.58
     Instr
    -0.58
     proble
    -0.57
    POSITIVE LOGITS
     Introduction
    0.86
    Introduction
    0.80
    [[
    0.75
     Conclusion
    0.74
    ³³³³
    0.73
    Joined
    0.71
     Quote
    0.70
    CHAPTER
    0.69
    =>
    0.68
    Offline
    0.67
    Act Density 0.301%

    No Known Activations