INDEX
Explanations
python code like the definition of a class
New Auto-Interp
Negative Logits
myſelf
-1.04
―――――
-0.96
itſelf
-0.96
Majefty
-0.94
purpoſe
-0.93
Anſ
-0.87
ſtate
-0.87
étoit
-0.87
Eſ
-0.85
himſelf
-0.84
POSITIVE LOGITS
DoubleQuotes
0.82
matchCondition
0.81
omitempty
0.78
<eos>
0.78
function
0.77
__((
0.75
↵↵
0.74
function
0.73
func
0.67
__':
0.66
Activations Density 0.222%