INDEX
Explanations
would be words related to competition or competitiveness
the term "compete" in various contexts related to competition
New Auto-Interp
Negative Logits
attribute
-0.72
ward
-0.69
haul
-0.66
anger
-0.63
sight
-0.62
hak
-0.62
Anth
-0.61
thus
-0.61
hod
-0.60
fab
-0.58
POSITIVE LOGITS
competing
0.86
competitions
0.82
ivity
0.82
estyles
0.81
compete
0.81
halla
0.78
competitive
0.77
competed
0.77
daq
0.76
competitors
0.75
Activations Density 0.008%