INDEX
Explanations
references to health-related concepts and terminology
technical or academic punctuation and citation markers.
New Auto-Interp
Negative Logits
ią
-0.60
Tur
-0.59
iſt
-0.59
headlong
-0.59
𝙫
-0.59
Houſe
-0.58
δες
-0.58
bigoplus
-0.58
שְׁ
-0.56
Syr
-0.56
POSITIVE LOGITS
,-,
1.42
.$,
1.28
′,
1.24
}}$,
1.23
++,
1.16
°,
1.16
,:),
1.14
€,
1.13
\%$,
1.13
.],
1.13
Activations Density 2.637%