INDEX
    Explanations

    references to abstract concepts or unspecified ideas

    New Auto-Interp
    Negative Logits
    ¥
    -3.72
    Ļª
    -3.69
    §
    -3.21
    ¬
    -3.19
    µ
    -3.17
    Ń
    -3.13
    ·
    -3.12
    Ī
    -3.12
    ¿½
    -3.02
    ĺ
    -3.01
    POSITIVE LOGITS
     else
    2.83
     resembling
    2.01
     productive
    1.87
     ELSE
    1.81
     like
    1.73
    Else
    1.73
     positive
    1.60
     akin
    1.58
     acidic
    1.55
     constructive
    1.55
    Act Density 0.100%

    No Known Activations