INDEX
Explanations
technical or programming-related terms and structures
Code within class definitions
self. and method definitions
New Auto-Interp
Negative Logits
Hays
-0.70
о
-0.69
ho
-0.65
tetten
-0.65
Hickey
-0.63
qu
-0.62
l
-0.62
ou
-0.61
Bue
-0.60
r
-0.59
POSITIVE LOGITS
0.88
itſelf
0.82
myſelf
0.79
leſs
0.75
<h2>
0.75
Hochspringen
0.72
setVerticalGroup
0.72
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.72
greateſt
0.72
[toxicity=0]
0.71
Activations Density 0.214%