INDEX
    Explanations

    technical descriptions or explanations

    the word "describe" and its variations, indicating descriptions or explanations of concepts and phenomena

    New Auto-Interp
    Negative Logits
    ild
    -0.78
    é¾įå
    -0.74
    osate
    -0.70
    uyomi
    -0.69
    youtube
    -0.68
    Tickets
    -0.68
    Lex
    -0.68
    ©¶æ¥µ
    -0.67
    assic
    -0.67
    lua
    -0.66
    POSITIVE LOGITS
     how
    1.20
     aspects
    1.02
     what
    0.98
     everything
    0.96
     behaviors
    0.89
    enance
    0.87
     exactly
    0.86
     anything
    0.86
     situations
    0.83
     behaviours
    0.82
    Act Density 0.208%

    No Known Activations