INDEX
Explanations
salt and pepper
markers of highly structured or technical text—acronyms/abbreviations, tagging/format labels, numerals and dates, units, and code- or punctuation-heavy tokens, rather than ordinary prose.
New Auto-Interp
Negative Logits
that
0.29
hẳn
0.27
(
0.26
что
0.25
άλλ
0.25
Emeritus
0.24
their
0.24
বিকল্প
0.24
Wellington
0.24
Communications
0.24
POSITIVE LOGITS
ہار
0.32
など
0.31
definisi
0.31
។
0.31
پانچ
0.31
イベント
0.30
៧
0.30
letzte
0.30
제거
0.30
verfol
0.30
Activations Density 0.733%