INDEX
    Explanations

    numeric comparisons or math-related expressions

    New Auto-Interp
    Negative Logits
    glers
    -0.49
     Zel
    -0.47
    وء
    -0.46
    onn
    -0.45
     respectively
    -0.44
    respectively
    -0.42
    ghijkl
    -0.42
     cu
    -0.42
    eding
    -0.42
    lei
    -0.41
    POSITIVE LOGITS
     étrangère
    0.82
     sauvages
    0.78
     aveug
    0.76
     étrangères
    0.73
     étranger
    0.71
     fermés
    0.70
    BufferException
    0.69
     Lightboxes
    0.69
    rrggbb
    0.68
     africaine
    0.68
    Act Density 0.013%

    No Known Activations