INDEX
Explanations
seek out words related to colors
references to the word "Turkey" or related terms
New Auto-Interp
Negative Logits
Opportunity
-0.66
[&
-0.65
urance
-0.64
Integrity
-0.63
moons
-0.62
Unch
-0.60
newsletters
-0.60
Holmes
-0.59
Constructed
-0.59
Competitive
-0.58
POSITIVE LOGITS
moil
1.45
kish
1.24
geon
1.19
ban
1.16
bid
1.15
fing
1.12
rible
1.10
bul
1.10
keys
1.10
ismo
1.05
Activations Density 0.020%