INDEX
Explanations
increasingly complex descriptions
New Auto-Interp
Negative Logits
ن
1.05
kiosks
0.98
fondos
0.96
atrocities
0.93
naudoj
0.92
ireo
0.91
biomarkers
0.90
linkages
0.90
concerts
0.89
hostilities
0.89
POSITIVE LOGITS
ingly
1.13
на
1.02
hafte
0.93
haften
0.86
дку
0.85
ik
0.84
씩
0.82
FULL
0.81
д
0.80
Në
0.80
Activations Density 0.077%