INDEX
Explanations
words related to cooling or temperature regulation
the term "cool" and its variations, indicating a focus on temperature or popularity
New Auto-Interp
Negative Logits
glas
-0.70
UNITED
-0.66
Pengu
-0.63
Starr
-0.61
riage
-0.61
PRESIDENT
-0.61
Mand
-0.60
lessly
-0.60
Canaver
-0.60
PLE
-0.60
POSITIVE LOGITS
idge
1.00
oola
0.87
achine
0.84
estone
0.83
est
0.81
factor
0.80
pants
0.79
breeze
0.77
ness
0.77
hens
0.76
Activations Density 0.022%