INDEX
Explanations
It activates on occurrences of the proper noun referring to the sports team (mentions of the Giants).
New Auto-Interp
Negative Logits
harma
-0.07
interviews
-0.06
Ire
-0.06
indigenous
-0.06
Vish
-0.06
ören
-0.06
pray
-0.06
IDD
-0.06
.GetValue
-0.06
rowth
-0.06
POSITIVE LOGITS
giants
0.17
Giants
0.16
Giant
0.13
giant
0.12
gigantic
0.08
Titan
0.08
Titans
0.08
titan
0.08
巨
0.07
Dodgers
0.07
Activations Density 0.005%