INDEX
Explanations
the name "Taylor Swift"
references to the name "Taylor" and "Taylor Swift."
New Auto-Interp
Negative Logits
hemy
-0.83
pread
-0.77
ser
-0.77
oard
-0.75
sets
-0.72
rely
-0.70
sky
-0.69
onent
-0.68
cffffcc
-0.67
rated
-0.66
POSITIVE LOGITS
Swift
1.18
Made
1.05
©¶æ¥µ
0.76
Beckham
0.73
@#&
0.73
Joy
0.72
Hawkins
0.71
Kemp
0.68
edo
0.68
Powers
0.66
Activations Density 0.038%