INDEX
    Explanations

    proper nouns, particularly names of people and organizations

    New Auto-Interp
    Negative Logits
    оÑī
    -0.17
     Tone
    -0.15
    OperationException
    -0.15
     Wars
    -0.15
    Adv
    -0.15
    emm
    -0.15
     Rencontres
    -0.14
    inals
    -0.14
    abeth
    -0.14
    ears
    -0.14
    POSITIVE LOGITS
    à¹Ģà¸ķà¸Ńร
    0.17
    asse
    0.15
     Lafayette
    0.15
    assel
    0.15
    bris
    0.15
    brit
    0.14
    šť
    0.14
    รà¸ĵ
    0.14
    lv
    0.14
    óż
    0.14
    Act Density 0.051%

    No Known Activations