INDEX
    Explanations

    phrases related to uncertainty and confirmation

    phrases that indicate the state or condition of something

    New Auto-Interp
    Negative Logits
    pmwiki
    -0.99
    Parameters
    -0.80
    SPONSORED
    -0.78
    ieties
    -0.74
    Marginal
    -0.72
     lengths
    -0.71
    */(
    -0.69
    ité
    -0.69
    è¦ļéĨĴ
    -0.68
    estyles
    -0.68
    POSITIVE LOGITS
     genuine
    1.16
     legit
    1.14
     authentic
    1.13
     real
    1.10
     indeed
    1.08
     true
    1.05
     theirs
    1.01
     actually
    1.00
     fake
    0.97
     hers
    0.96
    Act Density 0.276%

    No Known Activations