INDEX
Explanations
phrases related to asserting and maintaining a firm stance on a particular topic
New Auto-Interp
Negative Logits
gone
-0.82
NetMessage
-0.67
sung
-0.65
xual
-0.63
posure
-0.62
ãĥ¼ãĥ«
-0.61
ãĥŃ
-0.61
brance
-0.60
anon
-0.60
________________
-0.59
POSITIVE LOGITS
ently
0.90
ially
0.83
antly
0.77
encies
0.72
enance
0.72
vehemently
0.72
oux
0.70
iago
0.70
adherence
0.69
ively
0.69
Activations Density 9.488%