INDEX
Explanations
the color "gray" followed by nouns or descriptive terms associated with diverse contexts
references to the color gray
New Auto-Interp
Negative Logits
=-=-=-=-
-0.99
========
-0.83
ÄŁ
-0.80
etics
-0.78
Technical
-0.73
̶
-0.73
Witness
-0.73
itu
-0.72
ovie
-0.72
idem
-0.71
POSITIVE LOGITS
gray
1.15
grey
1.08
hound
1.03
gray
0.97
whale
0.92
beard
0.89
wolf
0.88
shading
0.86
pup
0.85
blur
0.85
Activations Density 0.008%