INDEX
    Explanations

    references to global events, particularly related to world wars

    New Auto-Interp
    Negative Logits
     InputDecoration
    -0.81
     desta
    -0.80
     Pilate
    -0.79
    Xie
    -0.78
     Pelosi
    -0.77
     Mortar
    -0.76
     Amtrak
    -0.75
     Cottages
    -0.73
     Ryanair
    -0.73
     Notary
    -0.73
    POSITIVE LOGITS
     World
    2.32
    World
    2.07
     WORLD
    2.04
     world
    1.95
    WORLD
    1.89
    world
    1.80
     Worlds
    1.61
    orld
    1.56
    Worlds
    1.49
     worlds
    1.48
    Act Density 0.044%

    No Known Activations