INDEX
Explanations
mentions of rivalries or conflicts
references to competition and rivalry
New Auto-Interp
Negative Logits
olia
-0.72
alam
-0.69
OST
-0.66
UX
-0.66
oglobin
-0.65
overe
-0.65
aughter
-0.65
istry
-0.64
umn
-0.64
uggage
-0.64
POSITIVE LOGITS
ries
1.35
rival
1.01
rivals
0.98
factions
0.86
challengers
0.85
competitor
0.79
competitors
0.77
contenders
0.75
bidder
0.73
rous
0.72
Activations Density 0.024%