INDEX
Explanations
names like Penny, Ruth, kid, fam
New Auto-Interp
Negative Logits
amese
0.40
قات
0.39
ตร์
0.39
和我
0.39
cedure
0.39
ployed
0.38
inkler
0.38
ipotent
0.38
੍
0.38
teki
0.37
POSITIVE LOGITS
શકાય
0.46
сега
0.45
competitively
0.43
қара
0.43
רא
0.42
الوزن
0.42
ليا
0.41
unisex
0.41
інтер
0.41
ita
0.41
Activations Density 0.001%