INDEX
Explanations
phrases starting with special characters or symbols
special characters or symbols that appear repeatedly
New Auto-Interp
Negative Logits
awaru
-0.74
NEC
-0.73
photoc
-0.69
swept
-0.68
çīĪ
-0.66
fed
-0.65
anwhile
-0.65
Kob
-0.63
MET
-0.61
Shib
-0.61
POSITIVE LOGITS
¢
0.93
§
0.90
£
0.90
¬
0.89
¹
0.89
¿
0.87
Ĵ
0.86
¼
0.84
¦
0.83
º
0.83
Activations Density 0.294%