INDEX
Explanations
references to the concept of selection in various contexts
selection contexts
New Auto-Interp
Negative Logits
tvrt
-0.54
psychiatrist
-0.54
llorar
-0.52
OCCURRED
-0.48
namorado
-0.48
Frat
-0.48
sidemargin
-0.48
Gesundheits
-0.48
embarazada
-0.47
<bos>
-0.47
POSITIVE LOGITS
selection
2.06
Selection
2.02
selection
1.83
Selection
1.80
SELECTION
1.66
selections
1.59
SELECTION
1.52
Selections
1.46
sélection
1.37
selections
1.37
Activations Density 0.011%