INDEX
Explanations
mentions of different states in a geopolitical context
references to geographical locations, particularly states and regions
New Auto-Interp
Negative Logits
Horowitz
-0.99
Pie
-0.87
Lennon
-0.84
Zup
-0.81
Hoffman
-0.80
Leo
-0.80
Dough
-0.79
Ogre
-0.78
Berger
-0.77
Pizza
-0.76
POSITIVE LOGITS
STATE
1.76
State
1.72
state
1.71
States
1.68
State
1.66
STATE
1.64
state
1.63
states
1.57
States
1.54
states
1.48
Activations Density 0.266%