INDEX
Explanations
terms related to increases or growth in various contexts
New Auto-Interp
Negative Logits
–
-0.69
’,
-0.65
B
-0.65
ar
-0.62
ja
-0.60
s
-0.59
mat
-0.59
’.
-0.59
ny
-0.58
t
-0.58
POSITIVE LOGITS
Increases
1.28
Increase
1.26
Increases
1.24
increase
1.21
Increase
1.20
UserScript
1.19
increase
1.17
myſelf
1.17
INCREASE
1.15
Efq
1.13
Activations Density 0.132%