INDEX
Explanations
proper nouns, specifically names, especially the name "Shah"
New Auto-Interp
Negative Logits
inary
-0.70
bunny
-0.70
omn
-0.69
lear
-0.67
auxiliary
-0.66
gradient
-0.65
sentient
-0.64
VILLE
-0.62
fiber
-0.62
boro
-0.61
POSITIVE LOGITS
Shah
4.00
Sharif
1.68
Hussain
1.59
Khan
1.53
Zah
1.48
Mohammad
1.48
Sultan
1.46
Shia
1.41
Sheikh
1.41
Ahmad
1.38
Activations Density 0.015%