INDEX
Explanations
references to legal or formal documents and their components
New Auto-Interp
Negative Logits
OGND
-0.52
PerformLayout
-0.46
UserScript
-0.44
nakalista
-0.43
Примітки
-0.42
***!
-0.42
Aholisi
-0.41
stood
-0.40
autorytatywna
-0.40
Italijanski
-0.39
POSITIVE LOGITS
anymore
0.70
unless
0.59
enää
0.58
jemals
0.55
nor
0.53
żad
0.51
ninguém
0.50
quaisquer
0.49
because
0.48
任何人
0.48
Activations Density 0.146%