INDEX
    Explanations

    phrases indicating division or categorization of items

    New Auto-Interp
    Negative Logits
    kal
    -0.18
    soever
    -0.16
    ÑĢал
    -0.16
    -FIRST
    -0.15
    éĽij
    -0.15
    immel
    -0.15
    ÅĻeb
    -0.15
    ieri
    -0.15
    bast
    -0.14
    bote
    -0.14
    POSITIVE LOGITS
    .rmi
    0.15
    rtle
    0.14
     notices
    0.14
    pix
    0.14
    AsStream
    0.14
    DEPTH
    0.14
    ANS
    0.14
     two
    0.14
    olia
    0.14
     McD
    0.13
    Act Density 0.010%

    No Known Activations