INDEX
Explanations
people expressing strong emotional responses or opinions about something
expressions of strong affection or appreciation
New Auto-Interp
Negative Logits
pload
-0.71
plur
-0.62
Journals
-0.61
ATURES
-0.59
countdown
-0.58
unanim
-0.57
ORTS
-0.57
Quantity
-0.56
prints
-0.56
mole
-0.56
POSITIVE LOGITS
entimes
0.83
nowadays
0.81
during
0.78
ingu
0.77
.
0.74
ðŁĺ
0.73
when
0.70
because
0.69
aws
0.69
anymore
0.69
Activations Density 0.117%