INDEX
    Explanations

    phrases that indicate varying conditions or categories

    New Auto-Interp
    Negative Logits
    <?
    -0.51
    berry
    -0.46
    ̍t
    -0.45
    IUrlHelper
    -0.45
     ^{-
    -0.44
    さえ
    -0.43
     &'
    -0.43
    ότητα
    -0.42
    letus
    -0.42
     stessi
    -0.42
    POSITIVE LOGITS
     different
    2.00
    different
    1.79
     varying
    1.78
    Different
    1.75
     Different
    1.68
     differing
    1.52
     unterschied
    1.50
     diferentes
    1.49
     DIFFERENT
    1.45
    不同的
    1.41
    Act Density 1.264%

    No Known Activations