INDEX
Explanations
phrases related to immediacy or urgency
expressions of frequency or intensity in actions or feelings
New Auto-Interp
Negative Logits
istries
-0.83
mare
-0.78
olas
-0.75
wr
-0.72
throw
-0.69
uffer
-0.67
irez
-0.67
aw
-0.66
estern
-0.66
SHIP
-0.65
POSITIVE LOGITS
progresses
0.75
progressed
0.71
standalone
0.68
NPR
0.67
confir
0.66
precaution
0.66
adjunct
0.64
redes
0.63
elaborated
0.63
result
0.62
Activations Density 0.170%