INDEX
Explanations
mentions of specific durations of time, particularly those related to sentences or imprisonments
occurrences of the word "in" followed by various contexts
New Auto-Interp
Negative Logits
NOW
-0.78
hirt
-0.71
ALSE
-0.68
mu
-0.67
pointers
-0.65
bryce
-0.65
tons
-0.64
DOS
-0.64
SourceFile
-0.63
deck
-0.63
POSITIVE LOGITS
advance
1.08
ordinate
1.00
accordance
1.00
preparation
0.96
lieu
0.94
conjunction
0.93
vitro
0.93
relation
0.91
order
0.91
patient
0.91
Activations Density 0.255%