INDEX
Explanations
phrases indicating an action or impact
phrases indicating actions or states of being
New Auto-Interp
Negative Logits
Columb
-0.64
rones
-0.63
eteenth
-0.63
igor
-0.62
comings
-0.61
Cub
-0.60
ston
-0.59
estern
-0.58
CLUD
-0.58
locked
-0.57
POSITIVE LOGITS
create
1.22
educate
1.16
revise
1.15
remove
1.14
shorten
1.12
reduce
1.12
introduce
1.11
elevate
1.10
minimize
1.10
simplify
1.10
Activations Density 0.163%