INDEX
Explanations
phrases related to decision-making and control
conjunctions and phrases indicating relationships or connections between ideas
New Auto-Interp
Negative Logits
REDACTED
-0.73
Closure
-0.72
Written
-0.70
TION
-0.68
iece
-0.68
ilateral
-0.68
imp
-0.67
ibl
-0.66
grave
-0.64
å°Ĩ
-0.63
POSITIVE LOGITS
pays
1.02
reap
1.01
accumulate
1.00
participates
0.99
participate
0.95
enjoy
0.93
populate
0.92
earn
0.92
inherit
0.92
regulate
0.92
Activations Density 0.656%