INDEX
Explanations
formatting elements and structural markers within the text
New Auto-Interp
Negative Logits
++
-0.45
<eos>
-0.45
ได้
-0.43
while
-0.43
fine
-0.43
rrggbb
-0.42
vrienden
-0.42
✭✭
-0.42
é
-0.41
szczegó
-0.41
POSITIVE LOGITS
صوتيه
0.71
findpost
0.66
AssemblyTitle
0.64
مصادر
0.62
tagHelperRunner
0.62
للمعارف
0.61
StoreMessageInfo
0.61
bibinfo
0.60
стаття
0.59
विश्वसनीयता
0.59
Activations Density 0.019%