INDEX
    Explanations

    instances of detailed narration or descriptions in texts

    Comes after quotes or excerpts

    New Auto-Interp
    Negative Logits
    featureID
    -0.80
    ſelf
    -0.63
    ſtanding
    -0.61
    ftagPool
    -0.61
     queſta
    -0.60
     NSCoder
    -0.59
     Jefus
    -0.59
    GEBURTSDATUM
    -0.59
    astéroïdes
    -0.59
    
    -0.58
    POSITIVE LOGITS
     Gesch
    0.35
    nicy
    0.34
     courtesy
    0.32
     लग
    0.31
    ้า
    0.31
     Weit
    0.31
    AutoScaleMode
    0.30
     parte
    0.30
     done
    0.29
     Sch
    0.29
    Act Density 0.998%

    No Known Activations