INDEX
    Explanations

    numeric values associated with various contexts or categories

    New Auto-Interp
    Negative Logits
    cheid
    -0.15
    ลาย
    -0.15
     hero
    -0.15
    alim
    -0.15
    riott
    -0.15
    kir
    -0.15
    zym
    -0.14
    Hero
    -0.14
    rip
    -0.14
    itors
    -0.14
    POSITIVE LOGITS
     å¹
    0.17
     Facilities
    0.17
    azzi
    0.16
    thrown
    0.16
    ÑĦа
    0.16
     FAC
    0.15
    çľ
    0.15
     fac
    0.15
     sink
    0.15
    iku
    0.15
    Act Density 0.025%

    No Known Activations