INDEX
Explanations
references to specific names, especially related to sports
mentions of specific athletes and figure prominently associated names
New Auto-Interp
Negative Logits
payer
-0.70
Mercury
-0.69
phas
-0.67
Gemini
-0.64
lucent
-0.64
eanor
-0.64
Salem
-0.63
Hunts
-0.62
jin
-0.60
Sax
-0.60
POSITIVE LOGITS
McGr
1.19
ath
0.91
acers
0.85
ength
0.84
untled
0.83
etz
0.81
aths
0.80
owship
0.80
iless
0.79
iddles
0.77
Activations Density 0.006%