INDEX
    Explanations

    mathematical expressions or equations

    New Auto-Interp
    Negative Logits
     termica
    -0.76
    rrggbb
    -0.68
     Agamemnon
    -0.64
     piatta
    -0.61
     Oboe
    -0.60
     taglia
    -0.59
     frontale
    -0.58
     coscienza
    -0.58
    odils
    -0.58
     ſtill
    -0.57
    POSITIVE LOGITS
     فريبيس
    0.83
    ')")
    0.73
    "])
    
    0.71
     désolés
    0.71
    )))));
    0.69
    )})
    0.69
    ")){
    
    0.67
    '){
    
    0.67
    })`
    0.65
    ()")
    0.65
    Act Density 0.078%

    No Known Activations