INDEX
Explanations
occurrences of the word "example."
New Auto-Interp
Negative Logits
thschild
-0.55
𝖉
-0.51
McNally
-0.51
initis
-0.50
bordada
-0.49
ardından
-0.48
Potatoes
-0.47
ferrous
-0.47
𝖇
-0.46
noir
-0.46
POSITIVE LOGITS
example
1.23
Example
1.19
Example
1.18
example
1.17
EXAMPLE
1.09
examples
1.06
EXAMPLE
1.04
Examples
1.00
Examples
0.99
examples
0.96
Activations Density 0.028%