INDEX
    Explanations

    names, especially surnames and titles related to individuals

    New Auto-Interp
    Negative Logits
    andum
    -0.07
    ERRU
    -0.06
    issan
    -0.06
    ã
    -0.06
     tam
    -0.06
    nis
    -0.06
    -в
    -0.06
    tam
    -0.06
     Robert
    -0.06
    onders
    -0.06
    POSITIVE LOGITS
    _IGNORE
    0.08
    tle
    0.08
    tered
    0.08
    enberg
    0.08
    aversal
    0.07
    skin
    0.07
    eatures
    0.07
    /repos
    0.07
    erman
    0.07
    acter
    0.07
    Act Density 0.004%

    No Known Activations