INDEX
Explanations
references to statements or claims made in a legal or formal context
New Auto-Interp
Negative Logits
only
-0.47
[]
-0.47
чью
-0.44
Only
-0.44
only
-0.43
phie
-0.43
duduk
-0.42
Others
-0.42
ally
-0.41
encore
-0.41
POSITIVE LOGITS
resaid
0.97
Мексичка
0.95
aforesaid
0.94
ledit
0.84
الإنجليزية
0.83
AssemblyProduct
0.83
GOTREF
0.82
Попис
0.82
LookAnd
0.82
WriteBarrier
0.81
Activations Density 0.568%