INDEX
    Explanations

    references to organizations, government programs, and legal terms

    New Auto-Interp
    Negative Logits
    <bos>
    -0.93
     a
    -0.74
     (
    -0.73
     y
    -0.70
     int
    -0.69
     ten
    -0.68
     t
    -0.68
     а
    -0.67
     from
    -0.67
    相对
    -0.67
    POSITIVE LOGITS
     silikon
    2.15
     alkoh
    2.05
     kram
    2.05
     hcm
    2.05
     aen
    2.01
     dises
    1.99
     keramik
    1.98
     mef
    1.96
     meis
    1.95
     fta
    1.94
    Act Density 0.143%

    No Known Activations