INDEX
    Explanations

    references to a specific individual, likely in a legal or investigative context

    New Auto-Interp
    Negative Logits
    
    -0.61
     noqa
    -0.57
    orney
    -0.53
    ا
    -0.51
    DockStyle
    -0.51
    WriteLiteral
    -0.51
     мәкал
    -0.48
    MemoryWarning
    -0.48
    DebuggerNonUser
    -0.48
    subpackage
    -0.47
    POSITIVE LOGITS
    tralight
    0.63
    tral
    0.58
     szolg
    0.57
    sul
    0.56
     indisponible
    0.56
     Vul
    0.55
    livan
    0.54
     Lyt
    0.54
    Autoritní
    0.54
    mány
    0.53
    Act Density 0.178%

    No Known Activations