INDEX
Explanations
repeated characters or symbols, particularly focusing on variations of accented letters
New Auto-Interp
Negative Logits
'\\;'
-0.94
》.
-0.89
*/}
-0.80
#:
-0.75
bezeichneter
-0.75
oa̍t
-0.74
tartalomajánló
-0.74
بوابة
-0.71
/*
-0.71
SequentialGroup
-0.70
POSITIVE LOGITS
â
2.05
â
1.90
Â
1.25
lâ
1.25
Â
1.24
Câ
1.18
lâ
1.09
Mâ
1.06
Bâ
1.05
câ
1.05
Activations Density 0.203%