INDEX
Explanations
mentions of specific geographic locations, particularly national parks
references to bears and specific national parks
New Auto-Interp
Negative Logits
inen
-0.89
oa
-0.86
advoc
-0.83
ischer
-0.82
rums
-0.81
printf
-0.77
usc
-0.76
ohn
-0.76
estinal
-0.75
atta
-0.75
POSITIVE LOGITS
Yosemite
0.92
Grizz
0.85
grizz
0.84
Delta
0.83
Yellowstone
0.78
Grizzlies
0.76
footh
0.73
jumper
0.72
Doodle
0.69
delta
0.68
Activations Density 0.016%