INDEX
Explanations
mentions of actions taken by people in various situations, such as being arrested, receiving compliments, reading articles, and expressing displeasure
New Auto-Interp
Negative Logits
$.
-0.66
%.
-0.59
};
-0.59
;}
-0.55
_.
-0.55
+.
-0.54
}.
-0.54
%;
-0.53
.;
-0.51
.</
-0.51
POSITIVE LOGITS
regarding
0.79
osponsors
0.77
lately
0.74
nowadays
0.71
relating
0.66
ento
0.66
here
0.62
herein
0.62
Regarding
0.62
buquerque
0.61
Activations Density 0.902%