INDEX
    Explanations

    attends to numerical values from associated units or metrics

    New Auto-Interp
    Head Attr Weights
    0:0.14
    1:0.12
    2:0.46
    3:0.05
    4:0.05
    5:0.04
    6:0.03
    7:0.06
    Negative Logits
     Ecke
    -0.27
    -0.27
    ...
    -0.27
    -0.27
    <eos>
    -0.26
     Chwiliwch
    -0.25
    -0.25
    :
    -0.24
    how
    -0.24
    ודה
    -0.23
    POSITIVE LOGITS
     AssemblyCompany
    0.49
    makeConstraints
    0.48
     arşivlendi
    0.47
    numerusform
    0.46
     Paglinawan
    0.46
     continúas
    0.44
    ArrowToggle
    0.44
    NUMX
    0.44
     FetchType
    0.43
    astify
    0.43
    Act Density 1.673%

    No Known Activations