INDEX
Explanations
words related to outdoor trails or paths
mentions of trails and related outdoor activities
New Auto-Interp
Negative Logits
Kamp
-0.69
Hague
-0.65
ately
-0.64
palp
-0.63
lesi
-0.62
ity
-0.61
Urs
-0.60
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.59
orporated
-0.58
tuition
-0.57
POSITIVE LOGITS
bl
1.26
Blazers
1.16
head
1.00
roads
0.95
ways
0.94
heads
0.94
toe
0.89
runner
0.84
find
0.84
walker
0.83
Activations Density 0.026%