INDEX
Explanations
the beginning of sections or paragraphs in text
New Auto-Interp
Negative Logits
])+
-0.37
},
-0.35
-0.34
}
-0.34
s
-0.34
}}+
-0.34
])*
-0.34
)
-0.33
1
-0.33
}
-0.33
POSITIVE LOGITS
<bos>
0.78
Geſch
0.70
Weiſe
0.68
<unused32>
0.68
détect
0.67
<unused41>
0.67
<unused79>
0.67
<unused14>
0.67
<unused8>
0.67
[@BOS@]
0.67
Activations Density 0.330%