INDEX
    Explanations

    words related to tolerance and intolerance

    New Auto-Interp
    Negative Logits
    s
    -0.70
    ethene
    -0.68
    -0.66
    BeforeClass
    -0.65
    Higgs
    -0.64
      
    -0.63
    n
    -0.62
    $
    -0.60
     Biggs
    -0.60
       
    -0.60
    POSITIVE LOGITS
     Toler
    1.69
     tolerance
    1.60
    Toler
    1.57
     Tolerance
    1.51
     tolerances
    1.47
     tolerant
    1.46
     toler
    1.46
    toler
    1.36
    tolerant
    1.34
    olerance
    1.30
    Act Density 0.010%

    No Known Activations