INDEX
Explanations
terms related to community engagement and social contributions
New Auto-Interp
Negative Logits
åĨĻ
-0.21
寫
-0.20
write
-0.19
writes
-0.17
écrit
-0.16
Write
-0.16
rush
-0.16
_written
-0.16
Tu
-0.16
iros
-0.15
POSITIVE LOGITS
talked
0.27
touched
0.24
spoke
0.23
briefly
0.19
Touch
0.19
touch
0.19
shared
0.19
mentioned
0.18
reminded
0.18
TOUCH
0.18
Activations Density 0.078%