INDEX
Explanations
proper nouns starting with the letter 'T'
acronyms and abbreviations related to organizations or categories
New Auto-Interp
Negative Logits
quar
-0.73
sanct
-0.60
Nur
-0.58
STATS
-0.57
ket
-0.54
virtues
-0.54
spat
-0.54
540
-0.53
Mecca
-0.53
imov
-0.53
POSITIVE LOGITS
edo
1.07
oshenko
1.05
ulhu
0.92
flix
0.85
uyomi
0.81
tsky
0.79
hiba
0.78
dfx
0.78
asury
0.77
antage
0.74
Activations Density 0.136%