INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -0.61
    sel
    -0.46
    mann
    -0.46
    gel
    -0.45
    yo
    -0.45
    -0.45
    zul
    -0.44
     ...
    -0.43
    jar
    -0.42
    sist
    -0.42
    POSITIVE LOGITS
     Efq
    0.72
    AddTagHelper
    0.61
     وتسجيلات
    0.60
     ferons
    0.59
    \{\\
    0.58
    AutoresizingMask
    0.57
    /**
    0.57
    MemoryWarning
    0.57
     autorytatywna
    0.56
    #
    0.56
    Act Density 0.190%

    No Known Activations