INDEX
Explanations
terms related to sports victories
instances of the word "win" and related terms in the context of victories
New Auto-Interp
Negative Logits
encyclopedia
-0.75
intestine
-0.71
sensit
-0.67
senal
-0.67
Journals
-0.65
bowel
-0.65
notor
-0.65
untarily
-0.61
erity
-0.60
umn
-0.59
POSITIVE LOGITS
nings
1.32
iem
1.01
now
1.00
throp
0.90
liness
0.87
athon
0.82
igan
0.80
watch
0.78
igans
0.78
ners
0.77
Activations Density 0.030%