INDEX
Explanations
phrases or terms wrapped in quotation marks
phrases enclosed in quotation marks
New Auto-Interp
Negative Logits
afar
-0.91
upon
-0.79
preceded
-0.78
merely
-0.77
mim
-0.76
summar
-0.75
elsewhere
-0.75
within
-0.74
mirrors
-0.74
simply
-0.74
POSITIVE LOGITS
Golden
1.45
ultimate
1.43
worst
1.41
classic
1.35
little
1.33
dark
1.33
big
1.33
anti
1.32
gold
1.31
great
1.31
Activations Density 0.076%