INDEX
Explanations
words related to value, representation, and development
terms related to valuation and representation
New Auto-Interp
Negative Logits
Feinstein
-0.69
rir
-0.67
Plex
-0.62
APTER
-0.61
Rowling
-0.60
Party
-0.60
erva
-0.58
Literary
-0.58
abiding
-0.58
liberating
-0.55
POSITIVE LOGITS
nesses
0.89
glers
0.79
glances
0.71
Opportun
0.67
(<
0.67
ealous
0.67
spoiled
0.67
expectations
0.65
stereotypes
0.64
WARE
0.63
Activations Density 0.093%