INDEX
Explanations
references to the name "Dylan" with varying degrees of relevance
mentions of specific names, particularly "Dylan"
New Auto-Interp
Negative Logits
spr
-0.87
rontal
-0.82
cryst
-0.69
enegger
-0.69
actionGroup
-0.68
Canary
-0.68
hips
-0.68
dayName
-0.67
streng
-0.66
satell
-0.65
POSITIVE LOGITS
Dylan
0.80
ylan
0.79
owe
0.77
rez
0.75
Ezra
0.74
Archdemon
0.74
Moran
0.74
rine
0.74
Matthews
0.73
onymous
0.72
Activations Density 0.031%