INDEX
Explanations
proper names, specifically the name "Joel"
references to individuals, particularly those named Joel and Isaac
New Auto-Interp
Negative Logits
hedral
-0.83
DragonMagazine
-0.74
omething
-0.72
abolic
-0.72
ewater
-0.70
ĸļ
-0.69
hift
-0.68
xy
-0.67
meal
-0.67
lease
-0.67
POSITIVE LOGITS
sson
0.80
iflower
0.80
itic
0.75
ands
0.75
son
0.73
raham
0.72
Castro
0.72
umni
0.72
itably
0.71
McH
0.69
Activations Density 0.103%