INDEX
    Explanations

    punctuation marks and references to historical figures and warfare

    New Auto-Interp
    Negative Logits
     McCart
    -0.15
     jes
    -0.15
    abaj
    -0.15
    ÙĦØ©
    -0.15
    reek
    -0.15
     Marr
    -0.14
     Traverse
    -0.14
    159
    -0.14
    156
    -0.14
    164
    -0.14
    POSITIVE LOGITS
     rede
    0.15
     Midlands
    0.15
    ĢìĿ´
    0.15
    anford
    0.14
     blot
    0.14
     Reagan
    0.14
    UnderTest
    0.14
     Pound
    0.14
    çĽ
    0.13
     Andersen
    0.13
    Act Density 0.024%

    No Known Activations