INDEX
Explanations
words related to instructions or steps required to complete a task
New Auto-Interp
Negative Logits
hey
-0.93
arer
-0.85
peak
-0.82
ĸļ
-0.79
gemony
-0.79
Cola
-0.78
lan
-0.78
arthed
-0.78
oche
-0.77
ruary
-0.76
POSITIVE LOGITS
amount
1.16
paperwork
1.15
ingredients
1.11
components
1.04
permissions
1.04
materials
0.99
amounts
0.98
portions
0.96
portion
0.94
parts
0.93
Activations Density 0.084%