INDEX
Explanations
girls' sexual content and refusal
New Auto-Interp
Negative Logits
pathname
1.14
웹
1.13
इस्लाम
1.06
scriptures
1.04
х
1.03
religious
1.03
parochial
1.01
deviation
1.00
समुद्री
0.99
የበ
0.99
POSITIVE LOGITS
erlä
0.98
Fakt
0.94
malah
0.93
Acht
0.92
Aug
0.92
senere
0.92
weiterhin
0.91
weniger
0.89
Ansch
0.89
Potenz
0.88
Activations Density 0.002%