INDEX
Explanations
references to authoritative figures and their statements
New Auto-Interp
Negative Logits
vė
-0.50
tahankan
-0.48
læng
-0.48
hjelp
-0.47
tonode
-0.45
inspirasi
-0.45
jorden
-0.45
ſelves
-0.44
vrijwilli
-0.44
pittore
-0.44
POSITIVE LOGITS
ьаж
0.50
########.
0.48
0.46
➟
0.46
Officials
0.46
StatelessWidget
0.45
ceo
0.44
comunicado
0.44
officials
0.44
Dr
0.44
Activations Density 0.180%