INDEX
Explanations
sequences that are likely non-linguistic characters or encoding artifacts, potentially related to a specific type of coding or processing
special characters or non-standard symbols
New Auto-Interp
Negative Logits
Beir
-0.79
orkshire
-0.72
Kern
-0.71
Anat
-0.69
guiActiveUn
-0.68
bonded
-0.68
Stoke
-0.67
xus
-0.65
Brist
-0.65
clerosis
-0.64
POSITIVE LOGITS
ļ
1.40
¿
1.33
ij
1.24
Ī
1.22
¹
1.20
¼
1.19
¶
1.19
¤
1.18
ı
1.17
¬
1.17
Activations Density 0.003%