INDEX
Explanations
social networking, gaming, texting
New Auto-Interp
Negative Logits
affirming
0.47
"--
0.41
leman
0.40
doi
0.38
ate
0.38
nations
0.37
iciencies
0.37
cultural
0.36
geography
0.36
gacchati
0.36
POSITIVE LOGITS
hazırl
0.47
ارزش
0.43
mitt
0.42
الحمل
0.40
endometri
0.40
आतंकियों
0.39
терро
0.39
modificaciones
0.39
verletzt
0.39
ایجاد
0.39
Activations Density 0.000%