INDEX
    Explanations

    specific terms and concepts related to a particular organization or classification system

    New Auto-Interp
    Negative Logits
    иÑĩеÑģÑĤво
    -0.25
     erste
    -0.24
    stvo
    -0.23
     weitere
    -0.23
     дÑĢÑĥгие
    -0.22
    еÑĩение
    -0.22
     andere
    -0.21
     ÑĸнÑĪÑĸ
    -0.21
     neue
    -0.21
     verschiedene
    -0.21
    POSITIVE LOGITS
    ного
    0.40
    ового
    0.36
    Ñģкого
    0.35
    енного
    0.35
    ÑĪего
    0.34
    кого
    0.33
    алÑĮного
    0.33
    оÑĩного
    0.32
    ÑģÑĤвенного
    0.31
    иÑĤелÑĮного
    0.31
    Act Density 0.053%

    No Known Activations