INDEX
    Explanations

    mentions of countries or nationalities

    New Auto-Interp
    Negative Logits
     kahit
    -0.60
     ļ
    -0.58
     adicionais
    -0.58
    NOWLED
    -0.58
     proszę
    -0.58
     froh
    -0.55
    ypeł
    -0.55
     mī
    -0.54
     izvē
    -0.54
    spania
    -0.53
    POSITIVE LOGITS
     liberality
    0.71
     Kün
    0.70
     Karsten
    0.69
     Schrö
    0.69
     Katrin
    0.65
     implacable
    0.64
     ingrat
    0.64
     Mathilde
    0.64
     Henk
    0.63
     Epif
    0.63
    Act Density 0.174%

    No Known Activations