INDEX
Explanations
references to sports rivalries
New Auto-Interp
Negative Logits
ariat
-0.16
indr
-0.15
robe
-0.15
Porno
-0.15
lero
-0.14
ByPrimaryKey
-0.14
itz
-0.14
ait
-0.14
anja
-0.13
ÑĢог
-0.13
POSITIVE LOGITS
rivalry
0.34
rivals
0.22
rival
0.22
hatred
0.22
bitter
0.22
brag
0.21
riv
0.20
heated
0.20
intense
0.20
anim
0.20
Activations Density 0.061%