INDEX
Explanations
certain verbs and expressions indicating ability, existence, and conditions related to actions or possessive states
New Auto-Interp
Negative Logits
oad
-0.17
ione
-0.17
respectively
-0.16
ÐĦ
-0.15
esan
-0.15
themselves
-0.14
respective
-0.14
ª½
-0.14
ëł¤ê³ł
-0.14
Stars
-0.14
POSITIVE LOGITS
embros
0.20
reu
0.18
yles
0.17
Ìĥ
0.16
egers
0.16
YLES
0.16
åĢij
0.16
ectors
0.15
ypes
0.15
ivals
0.15
Activations Density 0.370%