INDEX
    Explanations

    proper nouns, particularly names of people and places

    New Auto-Interp
    Negative Logits
    kontakte
    -0.17
    inus
    -0.16
    ActivityResult
    -0.15
    atus
    -0.14
    loquent
    -0.14
    ture
    -0.14
    emu
    -0.14
    oldt
    -0.14
    tick
    -0.13
    енÑĤа
    -0.13
    POSITIVE LOGITS
    opoulos
    0.33
    ou
    0.28
    akis
    0.28
     Pap
    0.27
    outs
    0.26
    ourg
    0.26
    oul
    0.26
     tou
    0.25
    iou
    0.25
    atos
    0.24
    Act Density 0.048%

    No Known Activations