INDEX
Explanations
phrases related to consecutive occurrences or repeated actions
phrases indicating repetition over multiple years
New Auto-Interp
Negative Logits
merce
-0.94
mathemat
-0.77
ascript
-0.75
metic
-0.73
omorphic
-0.72
ufact
-0.70
streng
-0.68
competence
-0.67
urate
-0.66
mosqu
-0.66
POSITIVE LOGITS
dy
1.19
dies
1.05
row
1.00
Row
0.87
er
0.85
ser
0.85
feed
0.81
rows
0.78
boat
0.77
dale
0.75
Activations Density 0.006%