INDEX
    Explanations

    punctuation marks and formatting indicators in text

    New Auto-Interp
    Negative Logits
     typelib
    -1.03
    AndEndTag
    -0.97
    WriteTagHelper
    -0.88
     ddelweddau
    -0.80
    IntoConstraints
    -0.79
     OMITBAD
    -0.78
     EconPapers
    -0.77
     pinulongan
    -0.76
    styleType
    -0.75
    tangentMode
    -0.75
    POSITIVE LOGITS
    ↵↵
    0.42
     "..\..\
    0.41
    AddWithValue
    0.39
    ↵↵↵
    0.38
    @@@@@
    0.38
     Flügel
    0.38
    Ecotoxicity
    0.37
    inheritDoc
    0.36
     desig
    0.35
    0.35
    Act Density 0.833%

    No Known Activations