INDEX
Explanations
references to locations or entities including the term "Penn."
mentions of the word "Penn"
New Auto-Interp
Negative Logits
eer
-0.78
rall
-0.65
cerebral
-0.63
eering
-0.62
========
-0.61
balloons
-0.61
LEASE
-0.61
willful
-0.60
PRES
-0.60
Malays
-0.60
POSITIVE LOGITS
sylv
1.48
sylvania
1.33
insula
1.00
iless
0.99
Penn
0.96
sburg
0.91
Penn
0.90
essee
0.88
olini
0.87
sat
0.87
Activations Density 0.011%