INDEX
Explanations
phrases or words indicating purpose or rationale in a context
New Auto-Interp
Negative Logits
principalColumn
-0.55
GenerationType
-0.52
propOrder
-0.52
ScopeManager
-0.51
therefor
-0.50
Baillargeon
-0.49
pourtant
-0.48
addirittura
-0.48
šinou
-0.47
مشين
-0.47
POSITIVE LOGITS
convenience
1.32
ease
1.25
clarity
1.24
easier
1.24
simplicity
1.20
easier
1.17
Easier
1.02
readability
1.00
convenience
0.99
Easier
0.99
Activations Density 0.557%