INDEX
    Explanations

    references to American identity and cultural elements

    New Auto-Interp
    Negative Logits
    istrovstvÃŃ
    -0.18
    ÑģÑı
    -0.16
    ey
    -0.16
    anim
    -0.14
    piring
    -0.14
    ochen
    -0.14
    APER
    -0.14
    pedia
    -0.14
    amen
    -0.14
    ings
    -0.14
    POSITIVE LOGITS
    ized
    0.19
     Samoa
    0.19
    ization
    0.17
    arily
    0.17
    isation
    0.16
    ERICA
    0.15
    ERICAN
    0.15
    ised
    0.15
    onitor
    0.14
     BirleÅŁik
    0.14
    Act Density 0.057%

    No Known Activations