INDEX
Explanations
names of individuals or organizations
abbreviations or initials of organizations and people
New Auto-Interp
Negative Logits
dracon
-0.69
jriwal
-0.62
FORMATION
-0.59
£ı
-0.53
perfect
-0.51
surv
-0.50
subp
-0.50
carbohyd
-0.50
podium
-0.50
etheless
-0.49
POSITIVE LOGITS
ich
0.73
uzz
0.73
cia
0.71
oni
0.71
ansky
0.70
ani
0.69
ia
0.68
illo
0.68
nik
0.67
jan
0.67
Activations Density 0.497%