INDEX
    Explanations

    references to Italy and Italian culture or cuisine

    New Auto-Interp
    Negative Logits
    cientos
    -0.70
    xuan
    -0.64
    mentaux
    -0.62
    Poppy
    -0.60
    </blockquote>
    -0.60
     Schrader
    -0.60
     Polskiego
    -0.59
     Evan
    -0.59
    udu
    -0.59
    قيقة
    -0.58
    POSITIVE LOGITS
     Italy
    1.49
    Italie
    1.37
     Italians
    1.36
    Italy
    1.36
     italy
    1.33
     Itali
    1.30
     Italian
    1.29
     ITALY
    1.24
     Italien
    1.19
     Italie
    1.18
    Act Density 0.084%

    No Known Activations