INDEX
Explanations
references to the color blue and its various shades or associations
New Auto-Interp
Negative Logits
makeConstraints
-0.51
ţiile
-0.46
țiile
-0.43
minator
-0.43
küche
-0.42
dź
-0.41
lumineuse
-0.40
ashian
-0.40
Foire
-0.40
weights
-0.40
POSITIVE LOGITS
blue
1.84
Blue
1.80
Blue
1.74
BLUE
1.72
blue
1.68
BLUE
1.67
blues
1.33
蓝
1.27
ブルー
1.23
mavi
1.22
Activations Density 0.117%