INDEX
Explanations
verbs related to convincing someone
New Auto-Interp
Negative Logits
owment
-0.73
practice
-0.70
eworthy
-0.69
alon
-0.69
alm
-0.69
abytes
-0.68
ESV
-0.67
Occupations
-0.66
bard
-0.66
nam
-0.66
POSITIVE LOGITS
tale
0.88
ingly
0.87
ments
0.82
reluctant
0.80
convinc
0.79
skeptics
0.78
MENT
0.78
ment
0.76
convince
0.76
MENTS
0.73
Activations Density 0.072%