INDEX
Explanations
words related to psychological qualities or states like aggressiveness, obsessiveness, or inclusiveness
terms related to different forms of effectiveness or impact
New Auto-Interp
Negative Logits
canon
-0.71
mill
-0.59
accident
-0.59
wells
-0.59
baker
-0.59
MILL
-0.58
Remix
-0.58
bi
-0.57
corn
-0.56
canonical
-0.55
POSITIVE LOGITS
iveness
4.76
ively
2.48
ivity
2.46
ivism
1.95
ives
1.91
ivities
1.83
ive
1.58
eness
1.51
ibility
1.46
uality
1.44
Activations Density 0.008%