INDEX
Explanations
mentions of the University of Notre Dame
references to "Notre Dame" and related entities
New Auto-Interp
Negative Logits
iegel
-0.67
oid
-0.63
served
-0.62
ulated
-0.62
USH
-0.61
owing
-0.61
ISTER
-0.60
tered
-0.60
oids
-0.60
ourke
-0.59
POSITIVE LOGITS
Dame
1.63
Qiao
0.88
Coach
0.85
Fighting
0.82
mson
0.81
Lauder
0.81
Jeanne
0.81
Bride
0.80
Decoder
0.78
................
0.77
Activations Density 0.007%