INDEX
    Explanations

    phrases related to importance or significance

    New Auto-Interp
    Negative Logits
     Winaray
    -0.53
    ########.
    -0.52
     <<<<<<<<<<<<<<
    -0.48
    EndContext
    -0.47
     ProtoMessage
    -0.45
    Larger
    -0.43
    \{\\
    -0.43
    :✨
    -0.42
     متعلقه
    -0.41
    Coarse
    -0.40
    POSITIVE LOGITS
     great
    1.77
    great
    1.33
     immense
    1.14
     tremendous
    1.13
     particular
    1.13
     extreme
    1.12
     enormous
    1.08
     GREAT
    0.98
     considerable
    0.98
     utmost
    0.92
    Act Density 0.630%

    No Known Activations