INDEX
Explanations
mentions related to the name "Prithvi" at different activations
references to a particular individual or character
New Auto-Interp
Negative Logits
spirited
-0.71
loud
-0.67
localization
-0.67
LEG
-0.66
goodbye
-0.65
rake
-0.63
couch
-0.61
bed
-0.61
eers
-0.61
savior
-0.60
POSITIVE LOGITS
udence
1.41
atche
1.23
ussia
1.23
ussian
1.22
ima
1.19
imum
1.15
imes
1.14
arie
1.13
ingle
1.13
imate
1.12
Activations Density 0.013%