INDEX
    Explanations

    words related to cooling or temperature regulation

    the term "cool" and its variations, indicating a focus on temperature or popularity

    New Auto-Interp
    Negative Logits
    glas
    -0.70
     UNITED
    -0.66
     Pengu
    -0.63
     Starr
    -0.61
    riage
    -0.61
     PRESIDENT
    -0.61
     Mand
    -0.60
    lessly
    -0.60
     Canaver
    -0.60
    PLE
    -0.60
    POSITIVE LOGITS
    idge
    1.00
    oola
    0.87
    achine
    0.84
    estone
    0.83
    est
    0.81
     factor
    0.80
    pants
    0.79
     breeze
    0.77
    ness
    0.77
    hens
    0.76
    Act Density 0.022%

    No Known Activations