INDEX
    Explanations

    expressions of ignorance or lack of knowledge

    New Auto-Interp
    Negative Logits
    amitié
    -0.61
    +:+
    -0.60
     amitié
    -0.59
     intStringLen
    -0.58
     dét
    -0.57
    WriteTagHelper
    -0.56
     blessé
    -0.56
    endphp
    -0.54
     planas
    -0.54
    dafx
    -0.52
    POSITIVE LOGITS
     ignorance
    2.72
     ignorant
    2.64
     Ignorance
    2.13
    orance
    1.29
     ignor
    1.26
    Ign
    1.08
    orant
    0.99
     igno
    0.92
     clueless
    0.91
     unaware
    0.87
    Act Density 0.001%

    No Known Activations