INDEX
Explanations
people's names, particularly the name "Dylan"
mentions of the name "Dylan" and related figures or topics
New Auto-Interp
Negative Logits
rontal
-0.84
actionGroup
-0.83
spr
-0.71
llah
-0.70
satell
-0.69
ments
-0.65
WATCHED
-0.65
Magikarp
-0.63
extremes
-0.62
MENT
-0.60
POSITIVE LOGITS
Dylan
1.00
rolet
0.87
TAMADRA
0.83
hedral
0.83
seys
0.82
elta
0.79
Moran
0.78
ylan
0.78
Matthews
0.75
CLS
0.72
Activations Density 0.009%