INDEX
Explanations
words related to competition and conflict
discussions related to fairness in competitive situations
New Auto-Interp
Negative Logits
translation
-0.70
Reviewed
-0.65
ĸļ
-0.63
english
-0.61
respectively
-0.61
surprisingly
-0.61
Reading
-0.59
Tip
-0.59
REDACTED
-0.59
aired
-0.58
POSITIVE LOGITS
..."
1.19
),"
1.10
,"
1.10
â̦"
1.10
,'"
1.09
[
1.08
?"
1.05
)."
1.00
?'"
1.00
.")
0.99
Activations Density 1.233%