INDEX
    Explanations

    the beginning of a new section or paragraph indicator in a text

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.85
    mybatisplus
    -0.74
    ThroughAttribute
    -0.72
     EconPapers
    -0.70
    ViewFeatures
    -0.70
     виправивши
    -0.70
    RenderAtEndOf
    -0.69
    ardless
    -0.68
    ConstraintMaker
    -0.67
     disambiguazione
    -0.66
    POSITIVE LOGITS
    dagogik
    0.53
    ...
    0.48
     beautiful
    0.48
     immaculate
    0.46
     BBQ
    0.44
    !
    0.43
     V
    0.42
    ıll
    0.42
    ございません
    0.42
    czę
    0.42
    Act Density 0.041%

    No Known Activations