INDEX
Explanations
words related to competition and superiority
instances of the word "out" in various contexts
New Auto-Interp
Negative Logits
ious
-0.74
ULAR
-0.72
initiation
-0.66
goodbye
-0.66
basement
-0.66
hell
-0.66
counselling
-0.63
farewell
-0.61
compulsory
-0.61
bitters
-0.59
POSITIVE LOGITS
value
1.27
strip
1.24
sell
1.23
values
1.22
per
1.21
match
1.21
do
1.20
stri
1.18
paced
1.13
shine
1.11
Activations Density 0.057%