INDEX
Explanations
mentions of specific names and detailed technical information
New Auto-Interp
Negative Logits
enegger
-1.08
enburg
-0.90
gow
-0.82
ãģ®éŃĶ
-0.79
iants
-0.78
beard
-0.72
omy
-0.70
Patreon
-0.69
worthiness
-0.68
Nanto
-0.67
POSITIVE LOGITS
³
1.30
¦
1.14
¹
1.09
Enix
1.08
¿
1.08
µ
1.08
¥
1.02
¼
1.00
¨
0.99
¾
0.99
Activations Density 0.769%