INDEX
Explanations
a specific character encoding or language pattern
characters from non-Latin scripts or possibly non-readable symbols
New Auto-Interp
Negative Logits
board
-0.81
merch
-0.69
cheer
-0.68
panels
-0.68
Bree
-0.67
cooler
-0.67
apparel
-0.66
Franch
-0.66
favor
-0.64
opportunity
-0.63
POSITIVE LOGITS
²
1.84
º
1.78
½
1.72
´
1.71
±
1.69
¾
1.69
¸
1.68
³
1.66
Ń
1.62
°
1.55
Activations Density 0.024%