INDEX
Explanations
occurrences of the word "each" followed by a number, emphasizing individual instances or quantities
New Auto-Interp
Negative Logits
haven
-0.73
tops
-0.72
rovers
-0.72
stones
-0.71
children
-0.68
abs
-0.68
models
-0.65
marks
-0.63
bows
-0.62
letters
-0.62
POSITIVE LOGITS
successive
1.24
iteration
1.02
individual
0.97
individually
0.95
respective
0.91
succeeding
0.84
element
0.83
participant
0.81
ounce
0.81
side
0.81
Activations Density 0.299%