INDEX
Explanations
direct references to movie titles within text
quotation marks
New Auto-Interp
Negative Logits
grasp
-0.86
care
-0.86
disciplinary
-0.80
matter
-0.80
delivery
-0.80
grips
-0.79
grav
-0.77
batter
-0.77
outfielder
-0.77
methodological
-0.77
POSITIVE LOGITS
Eat
1.36
Morning
1.35
Operation
1.30
Dear
1.30
Big
1.28
Bad
1.27
Friends
1.26
Golden
1.26
Untitled
1.25
Saturday
1.23
Activations Density 0.100%