INDEX
Explanations
phrases related to personal or professional achievements and attributes
key terms related to societal issues and change
New Auto-Interp
Negative Logits
arton
-0.59
awaru
-0.57
umption
-0.56
asions
-0.54
cies
-0.54
ancies
-0.54
mentioned
-0.50
istries
-0.50
Wem
-0.49
nia
-0.49
POSITIVE LOGITS
unto
0.66
fodder
0.63
centerpiece
0.57
underdog
0.56
sleeper
0.55
incarn
0.54
pinnacle
0.54
starter
0.53
conduit
0.52
ender
0.52
Activations Density 1.008%