INDEX
Explanations
words related to field trips, outdoor activities, and research experiences
New Auto-Interp
Negative Logits
-Clause
-0.18
cka
-0.16
vit
-0.16
pher
-0.16
Vit
-0.15
itudes
-0.15
timeofday
-0.15
ocrat
-0.15
RuleContext
-0.14
itude
-0.14
POSITIVE LOGITS
work
0.25
trip
0.25
Marshal
0.21
ing
0.21
marshal
0.21
sob
0.21
Yates
0.21
hockey
0.20
trip
0.19
trips
0.19
Activations Density 0.010%