INDEX
Explanations
references to specific individuals, particularly the name "Dana" and its variants in various contexts
mentions of the name "Dana" in various contexts
New Auto-Interp
Negative Logits
*/(
-0.74
tered
-0.69
eering
-0.67
invari
-0.67
inent
-0.66
blem
-0.66
asking
-0.65
GoldMagikarp
-0.63
ailable
-0.63
itudinal
-0.63
POSITIVE LOGITS
iesel
0.82
Scully
0.81
Vin
0.80
eger
0.76
Brooke
0.75
uala
0.75
her
0.74
uin
0.73
ei
0.72
illus
0.72
Activations Density 0.049%