INDEX
Explanations
references to hobbies or hobby-related activities
references to hobbies and leisure activities
New Auto-Interp
Negative Logits
curv
-0.67
Ezekiel
-0.65
Africans
-0.63
waves
-0.63
walls
-0.63
subtitles
-0.62
sclerosis
-0.62
oti
-0.61
AIDS
-0.59
Saints
-0.59
POSITIVE LOGITS
hobby
1.47
sonian
1.00
adelphia
0.94
icion
0.91
puter
0.88
reprene
0.84
arthed
0.84
conservancy
0.82
hobbies
0.81
finder
0.81
Activations Density 0.011%