INDEX
Explanations
structured data formats or code-related constructs
New Auto-Interp
Negative Logits
-
-0.70
tur
-0.54
’
-0.53
–
-0.52
2
-0.51
Tur
-0.50
T
-0.50
—
-0.49
/
-0.48
w
-0.48
POSITIVE LOGITS
]")]
1.22
),),
1.08
للاسماء
1.07
}(),
1.07
</caption>
1.01
).}
0.99
↵
0.99
}),
0.99
GEBURTS
0.98
,:),
0.97
Activations Density 0.177%