INDEX
    Explanations

    instances of the word "one," especially when it emphasizes singularity or distinction in a context

    New Auto-Interp
    Negative Logits
    ivate
    -0.16
    acie
    -0.15
     object
    -0.15
    acja
    -0.15
    _ht
    -0.14
     Hercules
    -0.14
    prot
    -0.14
    stad
    -0.14
    si
    -0.14
    outh
    -0.14
    POSITIVE LOGITS
    ,readonly
    0.16
    .scalablytyped
    0.15
    ebek
    0.15
    475
    0.15
    ebra
    0.15
    åĭĴ
    0.14
    echan
    0.14
    ÌĨ
    0.14
    绩
    0.14
    porno
    0.14
    Act Density 0.058%

    No Known Activations