INDEX
Explanations
verbs related to reversing actions or outcomes
instances of the word "undo" and its variations, focusing on themes of reversal and retraction
New Auto-Interp
Negative Logits
asio
-0.87
ramid
-0.79
ategories
-0.77
croft
-0.77
tek
-0.77
colo
-0.75
chens
-0.74
rikes
-0.74
tom
-0.72
Flavoring
-0.71
POSITIVE LOGITS
undo
1.06
undone
0.88
havoc
0.80
undo
0.75
issance
0.66
erase
0.64
disarm
0.64
popul
0.64
detract
0.64
dismantling
0.64
Activations Density 0.010%