INDEX
Explanations
spiritual, social, norms, history
New Auto-Interp
Negative Logits
бонус
0.72
бону
0.67
пару
0.62
komplett
0.61
интерфей
0.61
ჩატი
0.60
формате
0.59
онлайн
0.59
невероят
0.59
малень
0.59
POSITIVE LOGITS
spiritual
0.64
spiritual
0.57
矛盾
0.53
XI
0.51
Spiritual
0.51
socio
0.49
важней
0.48
differentiated
0.47
社會
0.47
II
0.46
Activations Density 0.007%