INDEX
Explanations
references to the word "mint", likely focusing on the object or organization rather than the flavor
references to "mint" and related terms
New Auto-Interp
Negative Logits
Ake
-0.73
Incarn
-0.65
Adv
-0.64
Soup
-0.64
Methodist
-0.64
Airbnb
-0.63
RAW
-0.63
---------
-0.62
voc
-0.61
Magikarp
-0.60
POSITIVE LOGITS
Seym
1.40
ãĥ¼ãĥĨãĤ£
0.90
mint
0.87
ulously
0.83
marks
0.83
oufl
0.81
eer
0.80
rats
0.80
stones
0.80
mark
0.78
Activations Density 0.017%