INDEX
    Explanations

    references to the Pentagon

    references to the Pentagon

    New Auto-Interp
    Negative Logits
    à
    -0.74
    lihood
    -0.71
    gger
    -0.71
    Hop
    -0.68
    cess
    -0.68
    Beer
    -0.68
     Constantin
    -0.66
    BOOK
    -0.65
    PER
    -0.65
    Ward
    -0.65
    POSITIVE LOGITS
     Pentagon
    1.07
     Papers
    0.92
    arium
    0.82
    ilion
    0.79
     brass
    0.77
    achev
    0.77
    selage
    0.77
     Plaza
    0.73
    eteria
    0.73
     contractors
    0.72
    Act Density 0.007%

    No Known Activations