INDEX
Explanations
verbs related to actions and qualities, but with a focus on negative actions and qualities
New Auto-Interp
Negative Logits
objective
-0.64
archment
-0.62
Window
-0.59
identification
-0.58
encyclopedia
-0.58
Nort
-0.58
adventurous
-0.57
Disclaimer
-0.57
cooler
-0.57
belonging
-0.57
POSITIVE LOGITS
ifies
1.44
izes
1.35
ulates
1.25
itates
1.25
wrote
1.25
uates
1.20
poses
1.15
iates
1.14
cedes
1.08
ounced
1.08
Activations Density 1.194%