INDEX
    Explanations

    companies, names, and details related to specific organizations or events

    New Auto-Interp
    Negative Logits
    <bos>
    -2.77
    
    
    -0.87
    -0.83
    SystemColors
    -0.74
    DataAnnotations
    -0.71
    <?
    -0.69
    -0.69
     Italijani
    -0.67
    font
    -0.67
    AllowUser
    -0.64
    POSITIVE LOGITS
     maneu
    1.81
     inev
    1.61
     increa
    1.55
     milano
    1.54
     impra
    1.53
     embra
    1.52
     depic
    1.52
     excru
    1.51
     indestru
    1.50
     emphat
    1.49
    Act Density 0.184%

    No Known Activations