INDEX
Explanations
mentions of the location "Los Angeles"
occurrences of the term "Los Angeles"
New Auto-Interp
Negative Logits
neut
-0.74
weak
-0.72
gen
-0.68
fertil
-0.67
fet
-0.66
trump
-0.65
weak
-0.63
quantum
-0.63
breaking
-0.62
preference
-0.61
POSITIVE LOGITS
Angeles
3.83
ANGEL
1.56
Aires
1.22
LAPD
1.14
Orleans
1.14
LA
1.14
Manila
1.13
Angels
1.10
Los
1.09
Angel
1.08
Activations Density 0.022%