INDEX
    Explanations

    attends to numerical values marked with ** from citation references marked with [[ ]]

    New Auto-Interp
    Head Attr Weights
    0:0.31
    1:0.26
    2:0.09
    3:0.05
    4:0.05
    5:0.07
    6:0.06
    7:0.07
    Negative Logits
    :✨
    -0.37
    AndEndTag
    -0.35
     defStyleAttr
    -0.35
     AttributeSet
    -0.34
     يتيمه
    -0.30
    Datuak
    -0.30
    ArgumentParser
    -0.29
    ernalia
    -0.27
    الدراسه
    -0.27
     JpaRepository
    -0.27
    POSITIVE LOGITS
    ,-,
    0.26
    最快更新
    0.25
    tific
    0.24
    Wear
    0.24
    wieder
    0.24
    $/,
    0.24
     Wear
    0.24
    "{\
    0.23
    duction
    0.23
    mary
    0.23
    Act Density 0.075%

    No Known Activations