INDEX
    Explanations

    words related to locations or proper nouns related to buildings/places

    New Auto-Interp
    Negative Logits
    <bos>
    -1.05
    lenmiş
    -0.74
    /*++
    -0.74
    lenir
    -0.67
     would
    -0.67
    <?
    
    -0.66
     became
    -0.65
     keep
    -0.65
    public
    -0.65
     accept
    -0.65
    POSITIVE LOGITS
    le
    2.00
     mef
    1.74
     effe
    1.71
     illi
    1.68
     sovere
    1.68
     Intere
    1.67
     socie
    1.65
     maneu
    1.64
     véhic
    1.64
     erec
    1.63
    Act Density 0.202%

    No Known Activations