INDEX
Explanations
phrases describing intentions or plans involving specific actions
phrases that involve conditions, consequences, or actions related to policies and governance
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.69
quished
-0.68
urnal
-0.66
Selected
-0.63
DX
-0.61
uber
-0.61
ĻĤ
-0.60
raved
-0.60
awed
-0.60
itarian
-0.59
POSITIVE LOGITS
thereby
1.21
eliminate
1.16
lest
1.15
preferably
1.13
abolish
1.11
stabilize
1.11
dismantle
1.08
consolidate
1.08
establish
1.04
weaken
1.04
Activations Density 0.319%