INDEX
Explanations
punctuation marks
sentences ending with a period
New Auto-Interp
Negative Logits
withd
-0.98
manif
-0.80
inver
-0.79
cabbage
-0.75
brut
-0.74
prototyp
-0.74
scaling
-0.73
onga
-0.70
challeng
-0.70
listeners
-0.70
POSITIVE LOGITS
Retrieved
1.55
jpg
1.46
Accessed
1.32
png
1.25
htm
1.17
txt
1.12
1.11
zip
1.10
exe
1.10
wav
1.08
Activations Density 0.298%