INDEX
Explanations
motions or actions described using the gerund form
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.86
Ħ¢
-0.85
HAHA
-0.84
ĪĴ
-0.84
«
-0.82
LEY
-0.79
İĭ
-0.78
¿½
-0.77
Ĥ¬
-0.75
HAEL
-0.75
POSITIVE LOGITS
ast
0.99
actory
0.98
asted
0.95
uzz
0.95
umbers
0.95
ickets
0.94
aunted
0.93
esides
0.93
umped
0.92
ynamic
0.91
Activations Density 0.077%