INDEX
Explanations
various mentions of the word "selection", both simple and in specific contexts
instances of the word "selection."
New Auto-Interp
Negative Logits
peria
-0.77
kos
-0.76
wig
-0.72
mond
-0.67
pton
-0.65
hur
-0.64
alone
-0.63
FORMATION
-0.61
ned
-0.61
aths
-0.61
POSITIVE LOGITS
selection
1.13
criteria
0.91
selection
0.88
Selection
0.84
ivity
0.82
selections
0.81
eering
0.79
axe
0.78
azy
0.77
ively
0.76
Activations Density 0.021%