INDEX
    Explanations

    references to locations and events related to cultural significance

    New Auto-Interp
    Negative Logits
     Muk
    -0.15
    leck
    -0.15
    alance
    -0.15
     Rouge
    -0.14
    ismo
    -0.14
    .UnitTesting
    -0.14
     KA
    -0.14
    mour
    -0.13
    lesen
    -0.13
     Afr
    -0.13
    POSITIVE LOGITS
     Bog
    0.38
     Gu
    0.24
     bog
    0.23
     Columbia
    0.22
     Ðijог
    0.21
     Lima
    0.21
     Dominic
    0.20
     Zac
    0.20
     cub
    0.20
     Jal
    0.20
    Act Density 0.088%

    No Known Activations