INDEX
    Explanations

    proper nouns and specific references, particularly related to individuals and institutions

    New Auto-Interp
    Negative Logits
    й
    -0.94
     Bartol
    -0.89
    一个
    -0.89
     manufact
    -0.89
    __*/
    -0.89
    Gott
    -0.87
    IIIIIIII
    -0.82
    йки
    -0.82
     Baldwin
    -0.81
     Sapi
    -0.81
    POSITIVE LOGITS
     soeur
    0.95
    0.93
     nationaux
    0.92
     suivie
    0.88
    ab
    0.85
     Lynd
    0.84
    ly
    0.83
    ag
    0.82
     sabbia
    0.82
     moeite
    0.79
    Act Density 2.158%

    No Known Activations