INDEX
Explanations
sympathy and emotional engagement
New Auto-Interp
Negative Logits
demonstrating
0.44
planning
0.44
upload
0.44
displaying
0.43
describes
0.43
conveys
0.43
uploading
0.42
exposing
0.41
planning
0.41
photography
0.41
POSITIVE LOGITS
sympathy
0.70
sympathetic
0.66
sympathies
0.66
सहानुभूति
0.64
sympathize
0.64
sympath
0.63
simpat
0.62
empath
0.59
empathetic
0.59
vic
0.57
Activations Density 0.009%