INDEX
Explanations
instances of the word "gain," indicating a focus on acquiring or increasing something
New Auto-Interp
Negative Logits
ordnung
-0.64
McDon
-0.62
Schlacht
-0.61
Corcoran
-0.61
dorp
-0.59
Cindy
-0.59
Brown
-0.58
Fred
-0.58
ئيس
-0.57
samo
-0.57
POSITIVE LOGITS
GAIN
1.28
Gains
1.19
gain
1.18
Gain
1.13
Gain
1.09
gains
1.07
gains
1.07
gain
1.05
gained
0.98
GAIN
0.98
Activations Density 0.014%