INDEX
Explanations
numerals and technical symbols
special characters or symbols, particularly the character "¸"
New Auto-Interp
Negative Logits
ses
-0.81
rants
-0.75
nings
-0.73
eatures
-0.73
Flavoring
-0.71
ancies
-0.70
icals
-0.70
zona
-0.69
synerg
-0.67
detail
-0.67
POSITIVE LOGITS
ãĤ§
1.10
ãĥ£
0.99
Ö¼
0.94
ãĥ¥
0.92
256
0.89
¸
0.88
Ö
0.86
wark
0.83
¾
0.82
Ì
0.81
Activations Density 0.011%