INDEX
    Explanations

    specific names or identifiers in the text

    New Auto-Interp
    Negative Logits
     виправивши
    -0.71
     سكانية
    -0.69
     }],
    -0.66
    })));
    -0.66
     ujednoznacz
    -0.65
    ")));
    
    -0.62
    ]-->
    -0.61
    ')));
    -0.59
    '}>
    -0.59
     ""}
    -0.59
    POSITIVE LOGITS
    Uninitialized
    0.61
    RTEX
    0.55
     tadi
    0.52
     manners
    0.51
     kidding
    0.51
    uslar
    0.50
    MessageState
    0.49
     again
    0.49
     čia
    0.49
    nowu
    0.49
    Act Density 0.113%

    No Known Activations