INDEX
Explanations
phrases related to correct decisions or choices
the concept of "right" in various contexts related to decision-making or choices
New Auto-Interp
Negative Logits
ĸļ
-0.81
anned
-0.81
ushima
-0.72
cit
-0.71
limited
-0.67
Film
-0.64
ivism
-0.61
oute
-0.61
rict
-0.60
Featuring
-0.60
POSITIVE LOGITS
amount
1.10
kinds
1.06
kind
1.05
balance
1.00
combination
1.00
proportions
0.99
thing
0.96
eous
0.92
ones
0.92
mix
0.91
Activations Density 0.059%