INDEX
    Explanations

    words end in -ing

    New Auto-Interp
    Negative Logits
    AnchorTagHelper
    -0.73
    born
    -0.71
    growing
    -0.63
     Growing
    -0.62
    InputBorder
    -0.62
    grown
    -0.59
     growing
    -0.56
     חיצוניים
    -0.56
    ROWN
    -0.54
    Growing
    -0.54
    POSITIVE LOGITS
    Personensuche
    0.56
    <bos>
    0.50
    0.48
    :✨
    0.46
    inine
    0.45
    SourceChecksum
    0.44
     ErrInvalid
    0.44
    uscular
    0.43
     lenker
    0.42
    évaluateur
    0.42
    Act Density 0.083%

    No Known Activations