INDEX
    Explanations

    affirmative statements or confirmations

    New Auto-Interp
    Negative Logits
    -0.62
    yntaxException
    -0.60
     AppBundle
    -0.58
    IContainer
    -0.56
    tagHelperRunner
    -0.55
     Rhymes
    -0.54
    DeleteBehavior
    -0.53
    etera
    -0.52
    einem
    -0.51
     insuffisamment
    -0.49
    POSITIVE LOGITS
     False
    0.84
    False
    0.84
     believers
    0.84
     believer
    0.79
     True
    0.76
     TRUE
    0.73
    false
    0.71
    TRUE
    0.70
    stdbool
    0.67
     colors
    0.67
    Act Density 0.094%

    No Known Activations