INDEX
Explanations
references to rules and regulations
New Auto-Interp
Negative Logits
المعيارى
-0.51
rhetorical
-0.50
nocześnie
-0.49
Gruß
-0.48
Plank
-0.47
ramienta
-0.47
MockBean
-0.46
Grüße
-0.45
وارد
-0.45
셈
-0.45
POSITIVE LOGITS
book
1.03
governing
0.78
book
0.77
books
0.75
BOOK
0.74
Governing
0.74
Book
0.70
autorytatywna
0.70
GOVERN
0.67
thumb
0.66
Activations Density 0.271%