INDEX
    Explanations

    references to World War I and World War II

    New Auto-Interp
    Negative Logits
    ities
    -0.17
    ctl
    -0.17
    ife
    -0.15
    apture
    -0.15
     Victorian
    -0.14
     older
    -0.14
    ilar
    -0.14
    سÙĬÙĨ
    -0.14
    etti
    -0.14
     Downing
    -0.14
    POSITIVE LOGITS
    -era
    0.33
     era
    0.26
    era
    0.24
     Era
    0.22
    ERA
    0.18
    -period
    0.17
     dönemde
    0.17
    å½¹
    0.17
    ucene
    0.16
    hamster
    0.16
    Act Density 0.032%

    No Known Activations