INDEX
    Explanations

    words related to incorrectness or errors

    references to the concept of being incorrect or unjust

    New Auto-Interp
    Negative Logits
    hens
    -0.76
    enance
    -0.70
     composed
    -0.63
     Flavoring
    -0.63
     Pione
    -0.62
     Swim
    -0.61
     Fn
    -0.60
    kamp
    -0.59
     Crunch
    -0.59
    hips
    -0.59
    POSITIVE LOGITS
    headed
    1.08
    fully
    1.07
     unfocusedRange
    0.86
    sight
    0.85
    eous
    0.83
    behavior
    0.81
    ftime
    0.81
     assumptions
    0.80
     guiActiveUn
    0.80
    do
    0.79
    Act Density 0.032%

    No Known Activations