INDEX
Explanations
proper nouns related to individuals, particularly the name "Warren" with varying strengths of activation
mentions of the name "Warren."
New Auto-Interp
Negative Logits
dayName
-0.87
pmwiki
-0.78
lies
-0.74
netflix
-0.73
ãĤ´ãĥ³
-0.71
liness
-0.70
etheless
-0.65
cific
-0.65
odic
-0.64
ly
-0.64
POSITIVE LOGITS
Buffett
1.41
sburg
1.12
Farrell
0.96
rade
0.91
Sapp
0.91
Harding
0.89
Buff
0.86
Warren
0.83
shire
0.77
Burger
0.76
Activations Density 0.027%