INDEX
Explanations
phrases indicating entering a place or starting an activity
repeated occurrences of the phrase "go in" with varying contexts
New Auto-Interp
Negative Logits
CLASSIFIED
-0.76
=]
-0.71
CVE
-0.69
STER
-0.62
Missing
-0.62
terday
-0.61
Percent
-0.61
Rest
-0.58
ende
-0.57
%"
-0.57
POSITIVE LOGITS
ordinate
1.15
front
1.02
humane
1.02
wards
1.01
vitro
1.01
bred
0.98
accordance
0.95
sole
0.95
bound
0.95
between
0.94
Activations Density 0.209%