INDEX
Explanations
phrases indicating adoption, popularity, or acceptance
phrases indicating activation or engagement with something
New Auto-Interp
Negative Logits
Ĥª
-0.72
msec
-0.61
Ķ
-0.60
BUT
-0.60
aurus
-0.59
714
-0.58
©¶æ¥µ
-0.57
Ĥİ
-0.54
cpp
-0.54
¿½
-0.53
POSITIVE LOGITS
behalf
1.29
erous
1.07
shore
1.07
etime
0.97
screen
0.91
top
0.86
eday
0.83
board
0.82
yx
0.80
demand
0.80
Activations Density 0.083%