INDEX
    Explanations

    references to locations and geographical features

    New Auto-Interp
    Negative Logits
     Roma
    -0.15
    ossa
    -0.15
     Bucc
    -0.15
     ваÑĢ
    -0.14
    ubb
    -0.14
    edException
    -0.14
     Gladiator
    -0.14
     roma
    -0.14
    isses
    -0.14
     Illum
    -0.13
    POSITIVE LOGITS
     eta
    0.26
    antz
    0.25
    itz
    0.24
    ertz
    0.24
     tx
    0.24
     Tx
    0.24
    ako
    0.23
    tx
    0.23
    aren
    0.23
     Bil
    0.22
    Act Density 0.005%

    No Known Activations