INDEX
Explanations
words related to conflicts or disputes, specifically using terms like "feud" and "rivalry."
terms related to conflicts or competitive relationships between individuals or groups
New Auto-Interp
Negative Logits
cise
-0.83
metic
-0.76
aza
-0.76
eared
-0.74
printed
-0.73
porting
-0.71
ascript
-0.70
argo
-0.69
akeru
-0.67
uggage
-0.65
POSITIVE LOGITS
feud
1.08
hips
0.88
rivalry
0.86
halla
0.79
rival
0.79
naire
0.77
SHIP
0.75
rivals
0.75
lords
0.71
disputes
0.67
Activations Density 0.018%