INDEX
Explanations
structured data formats or variables used in programming
New Auto-Interp
Negative Logits
even
-0.52
now
-0.44
Pued
-0.43
<eos>
-0.43
-0.42
Even
-0.39
ToReturn
-0.39
even
-0.38
terle
-0.38
käyt
-0.36
POSITIVE LOGITS
propOrder
1.06
الحره
1.05
Signalez
1.00
:✨
0.99
صوتيه
0.97
للمعارف
0.96
Италијани
0.92
InjectAttribute
0.92
ostavi
0.90
хьтан
0.89
Activations Density 0.327%