INDEX
    Explanations

    the beginning of a text or document

    New Auto-Interp
    Negative Logits
    '),
    
    -0.67
    "]
    
    -0.66
    ".
    
    -0.64
    '],
    
    -0.64
     ')
    
    -0.63
    )))
    
    -0.62
     ')
    -0.62
    GEBURTS
    -0.62
    "]),
    -0.62
    "]));
    -0.61
    POSITIVE LOGITS
     Roskov
    0.86
     himself
    0.77
     herself
    0.76
     himſelf
    0.68
     NSCoder
    0.65
    SBATCH
    0.65
    afficheront
    0.65
     GenerationType
    0.64
     TestBed
    0.63
     ***!
    0.63
    Act Density 0.334%

    No Known Activations