INDEX
Explanations
words related to the color green
references to the color green
New Auto-Interp
Negative Logits
̶
-0.82
ebin
-0.77
ROR
-0.76
hematic
-0.67
rators
-0.67
orically
-0.65
gerald
-0.63
rator
-0.63
DCS
-0.62
itars
-0.62
POSITIVE LOGITS
green
1.16
green
1.09
grass
0.96
leaf
0.94
peace
0.91
GREEN
0.87
Green
0.85
grass
0.84
wyn
0.84
GREEN
0.83
Activations Density 0.010%