INDEX
Explanations
phrases that express requests for feedback or assistance
New Auto-Interp
Negative Logits
ENEFITS
-0.51
ņas
-0.49
стоин
-0.49
ņa
-0.49
häls
-0.48
neſs
-0.48
XVI
-0.48
يلات
-0.47
junto
-0.46
USET
-0.46
POSITIVE LOGITS
GEBURTSDATUM
0.91
Diweddarwch
0.85
betweenstory
0.80
Gives
0.70
bewerken
0.68
Gives
0.66
चीज़ों
0.62
Personensuche
0.62
Giving
0.61
clusal
0.60
Activations Density 0.110%