INDEX
Explanations
phrases indicating causation or explanation
New Auto-Interp
Negative Logits
мәкал
-0.67
GEBURTSDATUM
-0.60
ſche
-0.60
nakalista
-0.60
trapez
-0.59
mator
-0.58
ollectionView
-0.57
Arme
-0.57
Попис
-0.57
ſelves
-0.56
POSITIVE LOGITS
because
0.46
mengingat
0.45
owing
0.41
cuarzo
0.39
Tatsache
0.38
reason
0.38
powodu
0.36
rodillas
0.36
disebabkan
0.36
是因为
0.36
Activations Density 0.088%