INDEX
    Explanations

    words related to the concept of "kind" or "kindness."

    New Auto-Interp
    Negative Logits
    cifix
    -1.09
     Roach
    -0.95
     Matos
    -0.90
    Дереккөздер
    -0.89
    ectomy
    -0.87
     Rossa
    -0.87
     Jagger
    -0.85
    bleven
    -0.85
     חיצוניים
    -0.82
    décoration
    -0.82
    POSITIVE LOGITS
     Kind
    1.51
     KIND
    1.47
    kind
    1.45
     kind
    1.43
    Kind
    1.40
    KIND
    1.37
    Kinds
    1.18
    kinds
    1.11
     Kinds
    1.09
     kinds
    1.08
    Act Density 0.066%

    No Known Activations