INDEX
Explanations
occurrences of the word "single" in various contexts
New Auto-Interp
Negative Logits
loor
-0.16
rang
-0.15
jang
-0.15
atz
-0.14
reluct
-0.14
kal
-0.14
ico
-0.14
la
-0.14
355
-0.14
bling
-0.14
POSITIVE LOGITS
tons
0.25
/single
0.22
-handed
0.17
ipa
0.16
tones
0.16
GEST
0.15
sti
0.15
stin
0.15
emean
0.14
еди
0.14
Activations Density 0.025%