INDEX
Explanations
mentions of parks or park-related activities
mentions of parks
New Auto-Interp
Negative Logits
dilig
-0.83
decomp
-0.71
nces
-0.67
Qiao
-0.65
xit
-0.64
minded
-0.62
soever
-0.61
alloy
-0.58
oxid
-0.58
bestos
-0.58
POSITIVE LOGITS
park
0.95
our
0.89
hurst
0.88
conservancy
0.87
walking
0.82
land
0.81
keepers
0.80
keeping
0.79
walk
0.79
walker
0.78
Activations Density 0.020%