INDEX
Explanations
phrases related to urging or advising others to take certain actions
pronouns that refer to individuals or groups, particularly in contexts of urging or advising action
New Auto-Interp
Negative Logits
Fried
-0.70
Siege
-0.69
fect
-0.68
Associated
-0.68
Assault
-0.64
Ange
-0.63
amer
-0.62
Atlantic
-0.61
assets
-0.61
People
-0.59
POSITIVE LOGITS
undertake
0.82
personally
0.77
self
0.73
tailor
0.72
know
0.71
perform
0.69
atically
0.68
retain
0.68
flexibility
0.68
borrow
0.68
Activations Density 0.178%