INDEX
    Explanations

    attends to various institution-related tokens from a range of institution-related tokens

    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.15
    2:0.07
    3:0.08
    4:0.22
    5:0.21
    6:0.07
    7:0.08
    Negative Logits
     ujednoznacz
    -0.41
    __*/
    -0.38
    NUMX
    -0.35
    haustible
    -0.34
     cherchés
    -0.33
     hambre
    -0.32
     precisione
    -0.32
    rshire
    -0.31
    enumi
    -0.31
     فريبيس
    -0.31
    POSITIVE LOGITS
    Билгалдахарш
    0.29
     Mangan
    0.26
    awtextra
    0.26
     déb
    0.25
    copi
    0.25
    ایت
    0.24
    angor
    0.23
    Referensi
    0.23
    脚注
    0.23
    GeneratedCode
    0.23
    Act Density 0.126%

    No Known Activations