INDEX
Explanations
words related to specific geographical locations, with a focus on California
references to the state of California and variations of the name Longhorns
New Auto-Interp
Negative Logits
theless
-0.89
tsky
-0.77
anwhile
-0.71
ueller
-0.65
combustion
-0.63
bed
-0.63
Loving
-0.62
guidance
-0.61
fetal
-0.60
isen
-0.60
POSITIVE LOGITS
ians
0.97
esse
0.88
enos
0.87
citiz
0.85
arians
0.82
Hots
0.81
esc
0.79
ian
0.77
oslov
0.76
escent
0.76
Activations Density 0.012%