INDEX
Explanations
specific numerical data and punctuation used in structured formats or lists
New Auto-Interp
Negative Logits
disambiguazione
-0.62
帖最后由
-0.60
تقاوى
-0.59
Administrativna
-0.58
ویکیپدیا
-0.57
enterOuterAlt
-0.55
gyhoeddwyd
-0.52
annica
-0.50
contextLoads
-0.50
patr
-0.49
POSITIVE LOGITS
:✨
0.45
<bos>
0.39
AssemblyTitle
0.37
personalities
0.36
➽
0.35
はじめに
0.34
✨:
0.34
/\.
0.34
taas
0.34
hires
0.34
Activations Density 0.018%