INDEX
Explanations
frequent mentions of the word "lots"
New Auto-Interp
Negative Logits
cref
-0.60
fap
-0.59
patties
-0.54
io
-0.50
Jah
-0.49
tout
-0.49
inchilla
-0.49
ք
-0.49
theit
-0.48
út
-0.48
POSITIVE LOGITS
lots
1.47
Lots
1.44
Lots
1.40
LOTS
1.16
lots
1.12
myſelf
0.78
ConstraintMaker
0.78
BufferException
0.78
loads
0.76
^(@)
0.75
Activations Density 0.035%