INDEX
Explanations
mentions of catalogs and their various forms
New Auto-Interp
Negative Logits
<bos>
-0.48
Jeremy
-0.41
ujednoznacz
-0.41
surla
-0.40
Jeremy
-0.40
неде
-0.39
CURIAM
-0.38
...?
-0.38
bio
-0.38
biographer
-0.37
POSITIVE LOGITS
Catalog
2.63
catalog
2.45
Catalog
2.39
catalog
1.98
CATALOG
1.91
CATALOG
1.84
catalogs
1.78
Catalogue
1.70
Catalogue
1.59
catalogue
1.57
Activations Density 0.002%