INDEX
Explanations
time durations specified in hours
references to "an hour and a half" or variations thereof
New Auto-Interp
Negative Logits
antha
-0.68
Surviv
-0.67
ologically
-0.67
ãĥ¤
-0.65
eus
-0.64
士
-0.64
ocrats
-0.63
thumbnails
-0.63
Prev
-0.63
Reviewer
-0.62
POSITIVE LOGITS
dozen
0.79
entimes
0.71
hearted
0.69
glance
0.68
million
0.67
heartedly
0.67
quel
0.65
sized
0.64
gallon
0.63
overs
0.63
Activations Density 0.016%