INDEX
Explanations
phrases indicating individual items or entities within a broader context
occurrences of the word "Each"
New Auto-Interp
Negative Logits
abs
-0.79
tops
-0.74
rovers
-0.73
haven
-0.72
dn
-0.69
duc
-0.68
Chain
-0.66
children
-0.66
ables
-0.65
stones
-0.65
POSITIVE LOGITS
successive
1.19
iteration
0.98
individual
0.88
participant
0.87
person
0.84
where
0.83
month
0.83
piece
0.81
member
0.80
year
0.78
Activations Density 0.038%