INDEX
Explanations
references to personal and familial history or heritage
New Auto-Interp
Negative Logits
nincs
-0.46
میگو
-0.44
と思っています
-0.44
دانشنامهٔ
-0.43
maktadır
-0.41
dafx
-0.41
છે
-0.41
mohou
-0.40
Says
-0.40
んでいます
-0.39
POSITIVE LOGITS
was
2.61
were
1.82
wasn
1.75
had
1.72
seemed
1.55
było
1.55
was
1.49
buvo
1.49
did
1.48
wasnt
1.47
Activations Density 9.197%