INDEX
Explanations
people's names in citations
New Auto-Interp
Negative Logits
fortiter
0.93
sostegno
0.90
figli
0.86
Phoen
0.81
conscientious
0.78
colp
0.78
suffrage
0.78
popolo
0.77
Scriptures
0.77
profession
0.76
POSITIVE LOGITS
แอ
1.46
1.44
کمپیو
1.43
အ
1.41
Ме
1.41
மெட்ரோ
1.39
Zhu
1.38
Zhang
1.38
তড়িৎ
1.37
อ
1.37
Activations Density 0.182%