INDEX
Explanations
proper nouns related to news articles
New Auto-Interp
Negative Logits
fortun
-0.73
interf
-0.73
polyg
-0.71
decomp
-0.69
seiz
-0.69
photoc
-0.69
pastry
-0.69
Marketable
-0.69
shack
-0.68
çīĪ
-0.67
POSITIVE LOGITS
Ļ
1.42
¬
1.38
ħ
1.27
¤
1.26
Ĵ
1.23
ª
1.22
£
1.21
ĸ
1.20
¡
1.20
ı
1.18
Activations Density 1.552%