INDEX
Explanations
phrases indicating multiplication or increase
instances of the word "fold" used in various contexts
New Auto-Interp
Negative Logits
vae
-0.71
indo
-0.69
Vital
-0.67
Esper
-0.66
anwhile
-0.66
vil
-0.65
pez
-0.63
Lots
-0.62
Altern
-0.61
Julio
-0.60
POSITIVE LOGITS
fold
1.62
fold
1.25
ername
1.03
Fold
0.90
folded
0.89
folding
0.88
ers
0.79
theless
0.78
folds
0.72
shr
0.71
Activations Density 0.004%