INDEX
Explanations
phrases related to gaining or increasing something
New Auto-Interp
Negative Logits
IALS
-0.66
Roberts
-0.64
Schröder
-0.61
als
-0.59
olato
-0.58
ordnung
-0.58
Stevenson
-0.58
Roberts
-0.57
me
-0.57
dorp
-0.56
POSITIVE LOGITS
GAIN
1.52
Gain
1.45
Gains
1.44
gains
1.39
gain
1.37
gain
1.32
Gain
1.30
gained
1.30
gains
1.24
GAIN
1.21
Activations Density 0.074%