INDEX
Explanations
punctuation and formatting in code or structured text
New Auto-Interp
Negative Logits
Nakamura
-0.67
w
-0.62
netbeans
-0.60
hu
-0.58
Ron
-0.56
cheid
-0.56
z
-0.56
zea
-0.55
ib
-0.55
Cle
-0.55
POSITIVE LOGITS
]")]
1.61
}")]
1.48
__":
1.38
.")]
1.36
__':
1.21
$")
1.21
}</
1.17
)";
1.17
')")
1.17
})*/
1.16
Activations Density 0.047%