INDEX
Explanations
words related to forceful actions or impacts
words related to dramatic or impactful actions
New Auto-Interp
Negative Logits
PF
-0.78
DOC
-0.70
ob
-0.67
broch
-0.63
science
-0.63
Ik
-0.63
Norway
-0.62
Frey
-0.62
ther
-0.61
amen
-0.61
POSITIVE LOGITS
ashing
4.17
ashed
2.79
ashes
2.70
ASH
2.03
ash
1.79
asher
1.35
attering
1.30
atching
1.25
usting
1.22
acking
1.11
Activations Density 0.005%