INDEX
Explanations
references to colors, particularly shades of yellow
mentions of the color yellow
New Auto-Interp
Negative Logits
rative
-0.88
enegger
-0.76
itia
-0.75
acters
-0.71
ichick
-0.70
weeney
-0.69
etimes
-0.69
ounter
-0.67
las
-0.65
ãĤ°
-0.65
POSITIVE LOGITS
Yellow
1.02
Fever
1.00
Jacket
0.91
knife
0.89
Voy
0.84
Jackets
0.83
Yellow
0.83
Route
0.82
Shirt
0.80
Matters
0.80
Activations Density 0.009%