INDEX
    Explanations

    proper nouns related to individuals or organizations

    New Auto-Interp
    Negative Logits
    -0.75
     d
    -0.72
    ...
    -0.72
     her
    -0.72
     no
    -0.72
     a
    -0.71
     so
    -0.70
     la
    -0.70
     to
    -0.70
     de
    -0.69
    POSITIVE LOGITS
     milano
    2.29
     cannes
    2.16
     bandung
    2.01
     tanga
    1.99
     napoli
    1.96
     marte
    1.96
     lele
    1.96
     sergio
    1.95
     jorge
    1.94
     casio
    1.94
    Act Density 0.352%

    No Known Activations