INDEX
    Explanations

    words and phrases indicating generalization or typicality

    New Auto-Interp
    Negative Logits
    <bos>
    -1.40
    MigrationBuilder
    -0.79
    InstrumentedTest
    -0.79
     GenerationType
    -0.77
    LookAnd
    -0.76
    InitVars
    -0.76
    NameInMap
    -0.74
    IUrlHelper
    -0.70
    RTLD
    -0.69
     gyhoeddwyd
    -0.69
    POSITIVE LOGITS
     contemporain
    0.65
     pylint
    0.60
     compromis
    0.57
    revet
    0.54
     Finalmente
    0.54
     biologique
    0.53
     contemporaine
    0.53
     clot
    0.52
     Obl
    0.52
     fewest
    0.52
    Act Density 0.638%

    No Known Activations