INDEX
Explanations
sharing knowledge and personal contributions
New Auto-Interp
Negative Logits
Ж
0.61
З
0.61
Са
0.57
Ро
0.55
你
0.55
Д
0.55
ρο
0.55
Το
0.54
О
0.54
poly
0.54
POSITIVE LOGITS
berbagi
0.96
compartilhar
0.93
sharing
0.89
condiv
0.88
share
0.87
Sharing
0.84
compartilh
0.82
conmigo
0.81
공유
0.80
conosco
0.78
Activations Density 0.033%