INDEX
Explanations
phrases related to holiday celebrations
the word "rule" and its variations, indicating a focus on governance or guidelines
New Auto-Interp
Negative Logits
ITNESS
-0.76
enegger
-0.75
20439
-0.70
æŃ¦
-0.68
POL
-0.67
CHO
-0.66
UTC
-0.65
OR
-0.64
adam
-0.64
à¨
-0.64
POSITIVE LOGITS
ule
1.43
ules
0.93
cules
0.92
lette
0.83
pees
0.80
leted
0.78
pee
0.78
phrine
0.77
ploy
0.77
bum
0.76
Activations Density 0.003%