INDEX
Explanations
adjectives describing effort, skill, emotion, or physical state
emotional or negative descriptors related to experiences and actions
New Auto-Interp
Negative Logits
THREE
-0.63
Aren
-0.62
TWO
-0.61
Tanz
-0.61
Empires
-0.61
Edison
-0.60
Leviathan
-0.56
Eston
-0.55
slice
-0.55
Waters
-0.54
POSITIVE LOGITS
ful
2.22
fully
2.09
fulness
1.72
lessly
1.67
less
1.66
full
1.64
FUL
1.44
ingly
1.43
lessness
1.38
ously
1.37
Activations Density 0.335%