INDEX
    Explanations

    instances of contrasting concepts or perspectives

    New Auto-Interp
    Negative Logits
    igar
    -0.15
    .struts
    -0.15
    avor
    -0.15
    enk
    -0.15
    loff
    -0.15
    isté
    -0.15
    mart
    -0.14
    ena
    -0.13
    alk
    -0.13
    ÙĦÙģ
    -0.13
    POSITIVE LOGITS
    ********************************************************************************
    0.15
    StandardItem
    0.15
     tay
    0.14
    andler
    0.14
    0.14
    ×
    0.14
    ×Ļ×
    0.13
    yleft
    0.13
    Meteor
    0.13
    772
    0.13
    Act Density 0.000%

    No Known Activations