INDEX
Explanations
the word "**rename**" or variations of it
occurrences of the word "renounce" and its related forms
New Auto-Interp
Negative Logits
Crunch
-0.75
apult
-0.74
Crunch
-0.74
medium
-0.72
Tornado
-0.70
catch
-0.69
iott
-0.68
Medium
-0.67
Mechanical
-0.67
weights
-0.67
POSITIVE LOGITS
ren
4.08
rename
1.52
relinqu
1.45
Ren
1.41
disav
1.35
repud
1.34
Ren
1.34
ren
1.14
renamed
1.12
abol
1.11
Activations Density 0.019%