INDEX
Explanations
words related to specific names, particularly "Mayer"
references to specific individuals and organizations
New Auto-Interp
Negative Logits
ivity
-0.74
rend
-0.74
verted
-0.65
away
-0.62
acters
-0.61
ifts
-0.61
claimed
-0.61
ordinate
-0.60
microwave
-0.60
MacArthur
-0.60
POSITIVE LOGITS
hattan
0.85
useum
0.85
ONEY
0.83
emonic
0.77
ixture
0.76
arijuana
0.75
Mayhem
0.75
ORPG
0.74
HAEL
0.73
achine
0.73
Activations Density 0.190%