INDEX
Explanations
proper nouns, specifically names of Jason
repeated mentions of the name "Jason."
New Auto-Interp
Negative Logits
ship
-0.94
recomm
-0.83
Ñĭ
-0.74
sheets
-0.74
topic
-0.72
itionally
-0.71
sheet
-0.71
ships
-0.70
erest
-0.70
herry
-0.69
POSITIVE LOGITS
Bour
0.86
Garrett
0.80
Rubin
0.79
Aaron
0.79
Alexander
0.79
antine
0.79
Lev
0.78
Hann
0.75
Chaff
0.74
Kessler
0.74
Activations Density 0.010%