INDEX
Explanations
modal verbs indicating possibility or capability
New Auto-Interp
Negative Logits
참고
-0.57
Chham
-0.56
aporan
-0.53
from
-0.53
throughout
-0.51
تفصیلات
-0.50
herself
-0.50
sendiri
-0.49
jinak
-0.49
usermodel
-0.48
POSITIVE LOGITS
anyone
1.39
they
1.38
we
1.36
anybody
1.28
you
1.10
it
0.99
anyone
0.98
ANYONE
0.95
she
0.92
someone
0.92
Activations Density 0.146%