INDEX
Explanations
phrases related to making changes or adjustments
phrases that indicate a reference to quantities or amounts
New Auto-Interp
Negative Logits
actionDate
-0.68
swer
-0.65
ifted
-0.62
ppers
-0.60
mbudsman
-0.59
envy
-0.59
masters
-0.58
î
-0.58
emergence
-0.57
planners
-0.57
POSITIVE LOGITS
aspects
0.81
ones
0.77
utations
0.76
existing
0.76
portion
0.72
thin
0.70
portions
0.70
unwanted
0.70
aciously
0.69
previously
0.69
Activations Density 0.323%