INDEX
Explanations
temporal phrases and indications of timing
New Auto-Interp
Negative Logits
Trend
-0.71
Compare
-0.71
BIP
-0.67
YouTube
-0.66
arat
-0.66
aired
-0.66
aska
-0.65
compares
-0.65
esian
-0.64
aloud
-0.63
POSITIVE LOGITS
unsc
1.01
refreshed
0.87
reinforcements
0.86
unaccompanied
0.84
unprepared
0.78
smelling
0.75
escort
0.74
heels
0.73
undet
0.72
intending
0.71
Activations Density 0.264%