INDEX
Explanations
punctuation and formatting elements in the text
New Auto-Interp
Negative Logits
Tur
-0.61
Tur
-0.59
Crus
-0.52
𝙫
-0.51
היש
-0.48
livejournal
-0.48
villaggio
-0.48
inės
-0.47
Ter
-0.46
Kết
-0.46
POSITIVE LOGITS
.$,
1.34
,:),
1.30
,-,
1.27
(",",1.26
,',
1.26
*,
1.24
,",
1.23
,,,
1.23
€,
1.23
{,1.22
Activations Density 3.925%