INDEX
Explanations
phrases related to personal accounts or stories
references to personal experiences
New Auto-Interp
Negative Logits
inately
-0.72
nda
-0.70
efficiency
-0.68
leaf
-0.68
laws
-0.67
nod
-0.67
vous
-0.66
pillar
-0.65
corn
-0.65
gem
-0.64
POSITIVE LOGITS
experiences
0.99
experien
0.95
firsthand
0.95
Exper
0.87
experience
0.86
Experience
0.82
ually
0.82
iences
0.80
ional
0.75
Shape
0.72
Activations Density 0.036%