INDEX
    Explanations

    specific forms of adjectives and verbs related to qualities and characteristics

    New Auto-Interp
    Negative Logits
    +#+#
    -0.65
     חיצוניים
    -0.61
     vérit
    -0.60
     GENERATED
    -0.59
     geda
    -0.59
    PerformLayout
    -0.59
    rungsseite
    -0.57
    :✨
    -0.57
     godz
    -0.56
     terecht
    -0.56
    POSITIVE LOGITS
    <bos>
    0.77
    AsUp
    0.60
    +:+
    0.55
     ImGui
    0.54
    UnusedPrivate
    0.54
    xious
    0.54
    Rhestr
    0.52
     TextAppearance
    0.52
    Mechan
    0.51
    Trimethyl
    0.50
    Act Density 0.598%

    No Known Activations