INDEX
Explanations
mentions of specific names, particularly last names
mentions of specific individuals, particularly those with the last name Reynolds
New Auto-Interp
Negative Logits
nesses
-0.98
itarian
-0.90
ness
-0.89
hip
-0.82
ian
-0.76
ingly
-0.76
inations
-0.75
¢
-0.74
ians
-0.73
ansas
-0.72
POSITIVE LOGITS
OPLE
0.86
McF
0.78
ISTER
0.75
isters
0.71
CLASSIFIED
0.70
tremend
0.67
cavity
0.64
conclud
0.63
issions
0.61
adden
0.61
Activations Density 0.133%