INDEX
Explanations
themes of fairness, opportunity, and change within competitive contexts
New Auto-Interp
Negative Logits
ê¶Į
-0.15
igham
-0.15
Ambient
-0.14
Affected
-0.14
//{{-0.14
ozor
-0.14
odzi
-0.14
itzer
-0.14
urd
-0.14
acker
-0.13
POSITIVE LOGITS
ally
0.17
uniqueness
0.15
sing
0.15
atório
0.15
virgin
0.15
novelty
0.15
IQUE
0.14
ique
0.14
mark
0.14
ste
0.14
Activations Density 0.177%