INDEX
Explanations
phrases indicating personal reflection and intention
New Auto-Interp
Negative Logits
AssemblyCulture
-1.02
IntoConstraints
-0.95
Мексичка
-0.87
expandindo
-0.86
виправивши
-0.85
Portály
-0.84
Autoritní
-0.82
Italijanski
-0.82
دانشنامهٔ
-0.81
Portale
-0.77
POSITIVE LOGITS
hit
0.46
kaç
0.45
";
0.45
If
0.44
my
0.43
"=>
0.43
</u>
0.43
will
0.43
my
0.43
'
0.42
Activations Density 0.125%