INDEX
Explanations
references to political and organizational communication
official letters and petitions
New Auto-Interp
Negative Logits
mişti
-0.31
uska
-0.31
<bos>
-0.30
temps
-0.29
-0.29
arany
-0.29
warnai
-0.27
PickerController
-0.26
سكانية
-0.26
parha
-0.25
POSITIVE LOGITS
protoimpl
0.67
Houſe
0.64
ſelves
0.60
Anſ
0.59
undersigned
0.58
Географија
0.58
ſtate
0.58
wiſe
0.58
ſou
0.58
点此举报
0.57
Activations Density 0.025%